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ACQUTKED RESISTANCE NPR GENES AND USES THEREOF 

Background of the Invention 
This invention relates to the fields of genetic engineering, plant biology, plant pathogen 
defense genes and their proteins, and crop protection. 

Recent advances in plant pathology have provided a basis for understanding the 
cellular and molecular genetic mechanisms by which plants defend themselves against 
pathogen attack In particular, plants are known to utilize at least two different types of 
defense mechanisms: u> the hypersensitive response {"HR") and (h) acquired resistance 
("AR"L including svstemic acquired resistance ("SAR") and local acquired resistance 
("LAR"). These defense mechanisms are discussed below. 

The Hypersensitive Response 

Plants respond in a variety of ways to pathogenic microorganisms (Lamb, Cell 
' 7 6:41 c >-422. 1094, 1 amb et aL Cell 56:2 1 5-224, 1989). One well-studied defense response 
that occurs at the site of infection is called the hypersensitive response ("HR") and involves 
rapid localized necrosis of the infected plant cells or tissue or both. The rapid death of the 
infected cells is thought to deprive invading pathogens of a sufficient nutrient supply, 
arresting pathogen growth. Cells undergoing a HR exhibit nuclear DNA fragmentation (for 
example, DN A laddering), a hallmark of apoptosis first described in animal systems, 
indicating that the HR involves active, programmed cell death (Mittler et aL, Plant Physiol. 
I0S.4S9-493. 1995, Greenberg et a!.. Celt "77- SSl-SM. 1994; Rycrson and Heath, Plant Cell 
8:393-402. 1996; Wang et aL, Plant Cell 8, 375-391 , 1996). The HR is also accompanied by 
a membrane-associated oxidative burst that results in the NADPH-dependent production of 
O, and H 2 0 . These reactive oxygen species may be directly toxic to invading pathogens or 
may be involved in the crosshnking of plant cell waiis surrounding the lesion to form a 
bamer to infection (Bradley et aL. Cell 70:21-30, 1992, Levine et aL, Cell 79:583-593, 1994) 

In the 1 950s, H.H. Flor developed a well-known genetic model that explains the 
observation that some races (strains) of a particular pathogen elicited a strong HR on a given 
cultivar of a host species, whereas other races (strains) of the same pathogen proliferated and 
caused disease (Flor. Anna Rev Phytopathol. 9:275-296. 1971'). A pathogen that elicits an 
HR is said to be avirulcnt on that host, the host is said to be resistant, and the 
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Acquired Resistance-Systemic and Local Acquired Resistance 

The HR not only blocks the local growth of an infecting pathogen, a is also thought to 
tneser additional defense responses in uninfected parts of the plant which become resistant to 
a variety of normally virulent pathogens (Enyedi et aL Cell 70:879-886, 1992; Malarny and 
Klessm, Plant J 2:643-654, 1992). This latter phenomenon is called systemic acquired 
resistance (SAR) and is thought to be the consequence of the concerted activation of many 
genes that are often referred to as pathogenesis-related {"PR") genes. The biological 
functions of many of these PR genes remain unknown; however, a large body of 
physiological, biochemical, and molecular evidence suggests that particular PR genes play a 
direct role in conferring resistance to pathogens. For example, some PR genes encode 
ehitinases and P 1 ,3-giucanases which directly inhibit pathogen growth in vitro (Mauch et al.. 
Plant Physiol. 88:936-942, 1988; Ponstem et al.. Plant Physiol 104:109-1 18, 1994; 
Schlumbaum et al.. Nature 324:365-367. 19S6; Sela-Buurlage et aL, Plant Physiol. 
101:857-863, 1993; Terras et aL, J. Biol. Chem. 267:15301-15309. 1992; Woloshuk et aL, 
Plant Cell 3:619-628, 1991 ). In addition, constitutive expression in transgenic plants of PR 
genes has been shown to decrease disease susceptibility in a limited number of cases 
(Alexander et aL, Proc Natl. Acad. Sci. USA 90:7327-7331, 1993; Liu et aL, Proc. Natl. 
Acad. Sci. USA 91 1888-1892, 1994; Terras et aL, Plant Cell 7:573-588, 1995; Zhu et al., 
Bio/Technology 12:807-812, 1994). 

SAR was originally defined by Ross ( Vimlng\' 14 340-358. 1961 ). who demonstrated 
that tobacco became resistant to infection hy a number oi vinjses after a primary inoculation 
with an avirulent strain of tobacco mosaic virus. Subsequently, it was demonstrated that 
SAR could also be elicited by other viruses, bactena, and fungi, and that the resistance 
induced by any particular pathogen was effective against a broad spectrum of viral, bacterial, 
and fungal diseases (Cameron et aL, P/aru J. b :/i:»-725, 1994, Cruikshank and Mandryk, J. 
Aust Inst. Agnc. Sci. 26:369-372, 1960; Dempsey et al., Phytopathology 83 : 1 02 1 - 1 029, 
1993; Hecht and Bateman, Phytopathology 54:523-530, 1964; Kuc, BioScience 39:854-860, 
1982, Lovrekovich et aL, Phytopathology 58: 1034-1035, 1968; Mauch-Mani and Slusarenko, 
Mol Plant-Microhe Interact. 7:378-383, 1994; Uknes et aL, Xfol. Plant-Microbe Interact. 
6 692-6^8. 1 ( W> 

Another acquired plant defense response that shares many features with SAR is 
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so-cal!ed local acquired resistance or "LAR " LAR develops m the direct vicinity of a 
successfully proliferating pathogen to block further spread of the pathogen and to thwart the 
occurrence of secondary infections. The same set of PR proteins is believed to be involved in 
conferring resistance by both LAR and SAR. and. as described below, the same signalling 
molecules also appear to be required for the onset of both responses 

Certain chemicals, such as salicylic acid (SA ). 2,^-dichloroisonicotinic acid (INA>. 
and benzo< 1 ,2,3)thiadiazole-7-carbothioic acid S-methyl ester (BTH) have been shown to 
induce SAR or LAR or both when applied exogenously to plants (White. Virolog\' 
99.410-412, 1979; Metraux et al.. Science 250:1004-1000. 1991 . Gorlach et a!.. Plant Cell 
8:629-643, 1 996). Moreover, several lines of evidence indicate thai endogenously produced 
SA is involved in the signal transduction pathway! s) coupling HR with the onset of SAR. In 
tobacco and cucumber, an increase in SA concentration has been observed after an avirulent 
pathogen infection when accompanied by the establishment of SAR (Goodman and Plurad, 
PhysioL Plant. Pathol 1:11-16. 1971; Malamy et al.. Science 250:1002-1004, 1990; Metraux 
et al.. Science 250:1004-1006, 1990; Rasmussen et al.. Plant Physiol 97:1342-1347, 1991) 
The accumulation of SA is also associated with the subsequent induction of genes including 
those encoding PR proteins (Van Loon and Van Kammen. Virology- 40: 1 99-2 1 1, 1970; Ward 
et al.. Plant Cell 3:1085-1094, 1991; Yalpam et al.. Plant Cell 3:809-818, 1991 ). In tobacco 
and Arabidopsis, exogenously applied SA can induce the accumulation of PR mRNAs, which 
is a characteristic of SAR (Uknes et al.. Plant Cell 4:645-050. 1992; Ward et al.. Plant Cell 
3:1085-1094, 1991 ; White, Virology 99:410-412. 1979). 

These results have led to the hypothesis that one of the consequences of pathogen 
infection is the accumulation of SA in vivo, which induces the expression of a set of proteins 
that act to limit further infection of the host (Ward et al.. Plant Cell 3 1085-1094, 1991 ) 
Direct support for this hypothesis has come from the observation that transgenic tobacco or 
Arabidopsis plants that express a bacterial gene encoding a salicylate hydroxylase are unable 
to accumulate SA and, consequently, do not exhibit either SAR or LAR (Gaffhey et al.. 
Science 261 :754-756, 1993). Thus, SA is thought to be required in vivo for the establishment 
of SAR and LAR, and, as described above, PR gene products appear to participate directly in 
conferring pathogen resistance. 
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Summary of the Invention 
In uenerai. the invention features an isolated nucleic acid molecule including a 
sequence encoding an acquired resistance ( AR) polypeptide, wherein the acquired resistance 
polypeptide is at least 40% (and preferably 50%, 70%. 80%, or 90%) identical to the amino 
5 acid sequence of Fig. 5 (SEQ ID NO:3> or Fig. 7B (SEQ ID NO: 14). Preferably, such a 
nucleic acid molecule encodes an acquired resistance polypeptide that mediates the 
expression of a pathogenesis-related polypeptide. In another preferred embodiment, the 
acquired resistance polypeptide includes an ankynn-repeat motif. 

Nucleic acid molecules of the invention are derived from any plant species, including, 

I * i without limitation, angiospcrms ( for example, dicots and monocots) and gymnosperms 

Exemplary plants from which the nucleic acid may be derived include, without limitation, 
sugar cane, wheat, nee, maize, sugar beet, potato, barley, manioc, sweet potato, soybean, 
sorghum, cassava, banana, grape, oats, tomato, millet, coconut, orange, rye, cabbage, apple, 
watermelon, canola, cotton, carrot, garlic, onion, pepper, strawberry', yam, peanut, onion, 

1 5 bean, pea, mango, and sunflower. Preferred nucleic acid molecules are derived from 

cruciferous plants, for example, Arabidopsis thaliana. Examples of cruciferous acquired 
resistance molecules are shown in Fig. 4 (NPR genomic DNA; SEQ ID NO:l ) and Fig. 5 
(NPR cDNA; SEQ ID NO:2). Other preferred nucleic acid molecules are derived from 
solanaceous plants, for example. Nicotiana glutinosa. An example of such a solanaceous 

20 acquired resistance molecule is shown in Fig. 7 A (SEQ. ID NO: 13). 

In another aspect, the invention features an isolated nucleic acid molecule (for 
example, a DNA molecule) that encodes an acquired resistance polypeptide that specifically 
hvbndizes to a nucleic acid molecule that includes the nucleic acid sequence of Fig. 4 (NPR 
genomic DNA; SEQ ID NO:l), Fig. 5 (NPR cDNA; SEQ ID NO:2), or Fig. 7A (SEQ ID 

25 NO:! 3). Preferably, the specifically hybridizing nucleic acid molecule encodes an acquired 
resistance polypeptide that mediates the expression of a pathogcncsis-related polypeptide. In 
another preferred embodiment, the specifically hybridizing nucleic acid molecule encodes an 
acquired resistance polypeptide including an ankynn-repeat motif. In yet other preferred 
embodiments, the specifically hybridizing nucleic acid molecule complements an acquired 

3'> resistance mutant (tor example, an Arabidopsis npr mutant) I he invention also features an 
RNA transcript having a sequence complementary to any of the isolated nucleic acid 
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molecules described above. 

In related aspects, the invention further features a ceil or a vector ( for example, a plant 
expression vector), each of which includes an isolated nucleic acid molecule of the invention 
In preferred embodiments, the cell is a bacterium (for example, E coli or Mrohactcnum 
5 tumefaciens) or is a plant cell (for example, is a cell from any of the crops listed above). 
Such a plant cell has an increased level of resistance against a disease caused by a plant 
pathogen (for example. Phyiophthora* Pcronospora, or Pseuciomonas). In yet another 
preferred embodiment, the isolated nucleic acid molecule of the invention is operably linked 
to an expression control region that mediates expression of a polypeptide encoded by the 

I'i nucleic acid molecule. For example, the expression control region is capable of mediating 

constitutive, inducible ( for example, pathogen- or wound- inducible), or eel i- or tissue-specific 
gene expression. The invention further features a cell (for example, a bacterium such as E 
colt or Agrobactcnum tumefaciens, or a plant cell) which contains the vector of the invention. 
In still another aspect, the invention features a transgenic plant including any of the 

15 above nucleic acid molecules of the invention integrated into the genome of the plant, 
wherein the nucleic acid molecule is expressed in the transgenic plant. In addition, the 
invention features seeds and cells from such transgenic plants. For example, such transgenic 
plants may be produced according to conventional methods using any of the above crop 
plants. 

20 In yet another aspect, the invention features a substantially pure acquired resistance 

polypeptide including an amino acid sequence that has at least 40% (and preferably. 50%, 
60%, 70%, 80% or 90%) identity to the amino acid sequence of 

Fia. 5 (SEQ ID NO:3) or Fig. 7B (SEQ ID NO: 14). Preferably, the acquired resistance 
polypeptide mediates the expression of a pathogenesis-related polypeptide In other preferred 

25 embodiments, the acquired resistance polypeptide includes an ankyrm-repeat motif or a G- 
protein coupled receptor motif. Such acquired resistance polypeptides are derived from any 
plant species, for example, those crop plants mentioned above. In preferred embodiments, 
the polypeptide of the invention is derived from a cruciferous species, for example, 
Arabidopsis thahana, or from a solanaceous species, for example, Nicoiiana gluunosa 

}j) in a related aspect, the invention also features a method of producing an acquired 

resistance polypeptide. The method involves: (a) providing a cell transformed with a nucleic 
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acid molecule of the invention positioned for expression in the cell; lb) cultunng the 
transformed cell under conditions for expressing the nucleic acid molecule, and (c) 
recovering the acquired resistance polypeptide The invention further features a 
recombinant acquired resistance polypeptide produced by such expression of an isolated 
nucleic acid molecule of the invention, and a substantially pure antibody that specifically 
recognizes and binds to an acquired resistance polypeptide or a portion thereof. 

In another aspect, the invention features a method of providing an increased level of 
resistance against a disease caused by a plant pathogen in a transgenic plant. The method 
involves - (a) producing a transgenic plant cell including the nucleic acid molecule of the 
invention integrated into the genome of the transgenic plant cell and positioned for 
expression in the plant cell; and (b) growing a transgenic plant from the plant ceil wherein the 
nucleic acid molecule is expressed in the transgenic plant and the transgenic plant is thereby 
provided with an increased level of resistance against a disease caused by a plant pathogen. 

In another aspect, the invention features methods of isolating an acquired resistance 
eene or fragment thereof The first method involves: (a) contacting the nucleic acid molecule 
of the invention or a portion thereof with a preparation of DN A from a plant cell under 
hybridization conditions providing detection of DN A sequences having 40% or greater 
sequence identity to the nucleic acid sequence of Fig. 4 (SEQ ID NO:l), Fig. 5 (SEQ ID 
NO;2), or Fig. 7 A (SEQ ID NO: 1 3); and (b) isolating the hybridizing DNA as an acquired 
resistance gene or fragment thereof. The second method involves: (a) providing a sample of 
plant cell DNA, (b) providing a pair of oligonucleotides having sequence homology to a 
region of a nucleic acid molecule of the invention; (c) contacting the pair of oligonucleotides 
with the plant cell DNA under conditions suitable for polymerase chain reaction-mediated 
DNA amplification; and (d) isolating the amplified acquired resistance gene or fragment 
thereof 

In preferred embodiments of the second method, the amplification step is earned out 
usine a sample of cDNA prepared from a plant cell In addition, the pair of oligonucleotides 
used in the second method are based on a sequence encoding an acquired resistance 
polypeptide, wherein the acquired resistance polypeptide is at least 40% (and preferably 50%, 
60%, T)%, 80%, or 90%) identical to the amino acid sequence of Fig. 5 (SEQ ID NO: 3) or 
Fig ^3 (SEQ ID NO:14) 
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By "acquired resistance" gene or "AR" gene is meant a gene encoding a polypeptide 
capable of triggering a plant acquired resistance response (lor example, a systemic acquired 
resistance ( SAR) or ioca! acquired resistance response ( LAR )) in a plant cell or plant tissue 
This response may occur at the transcriptional level or it may be enzymatic or structural in 
nature. AR genes may be identified and isolated from any plant species, especially 
agronomically important crop plants, using any of the sequences disclosed herein in 
combination with conventional methods known in the an. 

Bv "polypeptide" is meant any chain of amino acids, regardless of length or post- 
transiational modification ( for example, glycosylation or phosphorylation ). 

By "pathogenesis-related" polypeptide or "PR" polypeptide is meant a polypeptide 
that is expressed in conjunction with the establishment of SAR or l.AR. Exemplar/ PR 
proteins include, without limitation, chitinase, PR-la, PR1, PR5, GST (glutathione-S- 
transferasci, and P- 1 ,3 glucanase, osmotin, thionm, glycine-nch proteins (GRPs), 
phenylalanine ammonia lyase (PAL), and lipoxygenase (LOX). 

By M ankvnn-repeat M motif is meant a consensus motif that is found in a wide variety 
of proteins that are capable of mediating protein-protein interactions. Ankynn-repeat motifs 
are described in Michaely and Bennett (Trends in Cell Biology 2:127-129, 1992) and Bork 
{Proteins: Structure, Function, and Genetics 17:363-374, 1993). 

By "substantially identical" is meant a polypeptide or nucleic acid exhibiting at least 
40%, preferably 50%. more preferably 80%, and most preferably 90%, or even 95% 
homology to a reference amino acid sequence (for example, the amino acid sequence shown 
in Fig. 5 (SEQ ID NO:3) or Fig. 7B (SEQ ID NO: 14)) or nucleic acid sequence (for example, 
the nucleic acid sequences shown in Fig. 4, or Fig. 5, or Fig. 7A. SEQ ID NOS: 1, 2, and 13, 
respectively). For polypeptides, the length of comparison sequences will generally be at least 
If) ammo acids, preferably at least 20 amino acids, more preferably at least 25 amino acids, 
and most preferably 35 amino acids. For nucleic acids, the length of comparison sequences 
will generally be at least 50 nucleotides, preferably at least 60 nucleotides, more preferably at 
least 75 nucleotides, and most preferably 1 10 nucleotides 

Sequence identity is typically measured using sequence analysis software (for 
example, Sequence Analysis Software Package of the Genetics Computer Group, University 
of Wisconsin Biotechnology Center. 1710 University Avenue. Madison. WI 53705. BLAST. 
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or PILEL'P PRETTVBOX programs). Such software matches identical or similar sequences 
bv assienins degrees of homology to various substitutions, deletions, and- or other 
modifications. Conservative substitutions typically include substitutions within the following 
groups: glycine alanine; valine, isoleueine, leucine; asparuc acid, glutamic acid, asparagine. 

5 elutamme, serine, threonine, lysine, argimne; and phenylalanine, tyrosine 

By a "substantially pure polypeptide" is meant an AR polypeptide ( for example, an 
NPR polypeptide such as NPR1 ) that has been separated from components which naturally 
accompany it. Typically, the polypeptide is substantially pure when it is at least 60%, by 
wemht, free from the proteins and naturally-occurring organic molecules with which it is 

() naturally associated. Preferably, the preparation is at least 75%, more preferably at least 

00%. and most preferably at least 99%. by weight, an AR polypeptide A substantially pure 
AR polypeptide may be obtained, for example, by extraction from a natural source (for 
example, a plant cell); by expression of a recombinant nucleic acid encoding an AR 
polypeptide; or by chemically synthesizing the protein. Punty can be measured by any 

5 appropriate method, for example, column chromatography, polyacrylamide gel 
electrophoresis, or by HPLC analysis. 

By "derived from" is meant isolated from or having the sequence of a naturally- 
occurring sequence (e.g., a cDNA, genomic DNA, synthetic, or combination thereof)- 

By "isolated DNA" is meant DNA that is free of the genes which, in the naturaily- 

0 occurring genome of the organism from which the DNA of the invention is derived, flank the 
eene The term therefore includes, for example, a recombinant DNA that is incorporated into 
a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of a 
prokaryote or eukaryote; or that exists as a separate molecule (for example, a cDNA or a 
genomic or cDNA fragment produced by PCR or restriction endonuclease digestion) 

s independent of other sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence. 

By "specifically hybridizes" is meant that a nucleic acid sequence is capable of 
hybridizing to a DNA sequence at least under low stringency conditions as described herein, 
and preferably under high stringency conditions, al^o as described herein. 

0 Bv "transformed cell" is meant a cell into which (or into an ancestor of which) has 

been introduced, bv means of recombinant DNA techniques, a DNA molecule encoding (as 
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used herein) an AR polypeptide. 

Bv "positioned for expression" is meant that the DNA molecule is positioned adjacent 
to a DNA sequence which directs transcription and translation of the sequence (i.e., facilitates 
the production of, for example, an AR polypeptide, a recombinant protein, or an RNA 
molecule ). 

By "reporter gene" is meant a gene whose expression may be assayed; such genes 
include, without limitation, P-g!ucuronidase (GUSh lucifcrase. chloramphenicol 
transacetvlase (CAT), green fluorescent protein (GFPf fi-galactosidasc. herbicide resistant 
genes and antibiotic resistance genes. 

By "expression control region" is meant any minimal sequence sufficient to direct 
transcription. Included in the invention arc promoter elements thai are sufficient to render 
promoter-dependent gene expression controllable for cell-, tissue-, or organ-specific gene 
expression, or elements that are inducible by external signals or agents (for example, light-, 
pathogen-, wound-, stress-, or hormone-mducibie elements or chemical inducers such as SA 
or INA), such elements may be located in the 5' or 3' regions of the native gene or engineered 
into a transgene construct. 

By "operably linked" is meant that a gene and a regulator,' sequence(s) are connected 
in such a way as to permit gene expression when the appropriate molecules (for example, 
transcriptional activator proteins) are bound to the regulatory sequence(s). 

By "plant cell" is meant any self-propagating cell bounded by a semi-permeable 
membrane and containing a plastid. Such a cell also requires a cell wall if further 
propagation is desired. Plant cell, as used herein includes, without limitation, algae, 
cvanobactena, seeds, suspension cultures, embryos, menstematic regions, callus tissue, 
leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. 

By "crueller" is meant any plant that is classified within the Cruciferae family. The 
Cruci ferae include many agricultural crops, including, without limitation, rape (for example, 
Hrassica campestns and Brassica napus), broccoli, cabbage, brussel sprouts, radish, kale, 
Chinese kale, kohlrabi, cauliflower, turnip, rutabaga, mustard, horseradish, and Arabidopsis. 

By "transgene" is meant any piece of DN A \s hich is inserted by artifice into a cell, 
and becomes part of the genome of the organism which develops from that cell. Such a 
transgene may include a gene which is partly or entireU heterologous ( i.e.. foreign) to the 
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transgemc organism, or may represent a gene homologous to an endogenous gene of the 
organism. 

By "transgenic" is meant any cell which includes a DNA sequence which is inserted 

by artifice into a cell and becomes pan of the genome of the organism which develops from 
5 that cell. As used herein, the transgenic organisms are generally transgenic plants and the 

DNA (transgene) is inserted by artifice into the nuclear or piastjdic genome. A transgenic 

plant according to the invention may contain one or more acquired resistance genes. 

By "pathogen" is meant an organism whose infection of viable plant tissue elicits a 

disease response in the plant tissue. Such pathogens include, without limitation, bacteria, 
in mycoplasmas, fungi, insects, nematodes, viruses, and viroids. Plant diseases caused by these 

pathogens are described m Chapters i 116 of Agrios, Plant Pathology. 3rd ed , Academic 

Press, Inc., New York, 1988. 

Examples of bacterial pathogens include, without limitation, Erwima (for example, E 

carotovnra). Pseudomonas (for example, P. svringac) y and Xanthomonas (for example, X. 
I 5 campepestris and X. oryzae). 

Examples of fungal disease-causing pathogens include, w ithout limitation, Alternaria 

(for example, A. hrassicoia and A.soiani). Ascochyta (for example, A. pisi) % Botrytis (for 

example, B. cinerea), Cercospora (for example, C. kikuchu and C. zaea-maydis), 

Colletotnchum sp. (for example, C. lindemuthianum), Diplodia (for example. D. maydis), 
2 n Er\'siphe (for example, E gramims f sp. gramims and E. gramims f.sp. hordei), Fusanum 

(for example, F ravale and f oryspomm.. F graminearum . F. soiani. F. monilforme, and F. 

roseum), Gaeumanomyces (for example, G gramims f.sp. tritici), Helminthosporium (for 

example, H. turcicunu H carhonum, and H maydis), Macrophomina (for example, M. 

phaseolma and Maganaporthe grisea). Xectria (for example, ;V. heamatoeaeca), 
25 Peronospora (tor example, P. manshurica, P. tabaana), Phoma (for example, P. belae), 

Phvmatotrichum (for example, P. ommvorum), Phytophthora (for example, P. cinnamomu P. 

cactorum^ P. phaseoli, P parasitica, P citrophthora, P. mcgasperma f.sp. sojac, and P 

infestans). Plasmopara (for example, P. viticola). Podosphaera (for example, P. leucotricha). 

Puccima (for example, P. sorghi, P strnformis* P. gramims f.sp. tritici, P asparagi, P 
3<> rccondua, and P. arachidis), lhahium (for example, P aphamdermatum), Pvrenophora (for 

example, P tritici-repentens). Pyncularia (for example. P. on-zea). Pvthium (for example, P 
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uinmun:). Rhizoctonia (for example. R solum and R cercahs). Scerottum ( for example, S 
n.dfsm. Sclcroimia (for example. .V. scterotiorum), Scntona (for example. S ivcopersici, S 
glycines. S. nodorum and 5. tntici), Thiela\ lopsis (for example, T hasicola), L'ncinula (for 
example. U necator). Veniuria ( for example. V. inacqualis). I erticiUiuni ( for example, V. 
dahhae and V alko-atrum). 

Examples of pathogenic nematodes include, without limitation, root-knot nematodes 
(for example, Meloidogyne sp such as M. incognita, M arenaria. \ f. chitwoodi, \f hapla, M 
/avamca, M graminocola, M microti la, M. graminis, and M naasi), cyst nematodes (for 
example, Heterndera sp. such as //. schachtu, II glycines, 11 sacchan. H oi^zae, H. avenue* 
II. c a/ani, II elachista. hi. goettin^iana. H ^ramims, H mediterranean II. mothi, 11. sorght, 
and // zeac, or. for example, Glohodera sp such as G rosiochicnsis and G. pallida), root- 
attacking nematodes (for example, Rot\ lenchulus reniformis, Tvlenchuvlus semipenetrans. 
Pratvlenchus brachvunis. Radopholus citrophilus, Radopholus similis , Xiphinema 
americanum, Xiphinema riyesi, Paratrichodorus minor, Heterorhahditis heliothidis. and 
Bursaphelenchus xvlophdus), and above-ground hematodes (for example, Anguina funesta. 
Anguina tritici, Ditvlenchus dipsaci, Ditvlcnchus mvceliphagus, and Aphenlenchoides 
hesseyi). 

Examples of viral pathogens include, without limitation, tobacco mosaic virus, 
tobacco necrosis virus, potato leaf roll virus, potato virus X, potato virus V, tomato spotted 
wilt virus, and tomato ring spot virus. 

By "increased level of resistance" is meant a greater level of resistance to a disease- 
causing pathogen in a transgenic plant (or cell or seed thereof) of the invention than the level 
of resistance relative to a control plant ( for example, a non-transgenic plant). In preferred 
embodiments, the level of resistance in a transgenic plant of the invention is at least 20% (and 
preferably 30% or 40%) greater than the resistance of a control plant. In other preferred 
embodiments, the level of resistance to a disease-causing pathogen is 50% greater, 60% 
greater, and more preferably even 75% or 90% greater than a control plant; with up to 100% 
above the level of resistance as compared to a control plant being most preferred. The level 
of resistance is measured using conventional methods. For example, the level of resistance to 
a pathogen may be determined by comparing physical features and characteristics (for 
example, plant height and weight, or by comparing disease symptoms, for example, delayed 
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lesion development, reduced iesion size, leaf wilting and curling, water-soaked spots, and 
discoloration of cells) of transgenic plants 

Bv "detectably-labelled" is meant any direct or indirect means for marking and 
identifying the presence of a molecule, for example, an oligonucleotide probe or primer, a 
gene or fragment thereof, or a cDNA molecuie or a fragment thereof. Methods for 
detectably-labelling a molecule are well known in the art and include, without limitation, 
radioactive labelling (for example, with an isotope such as - ,2 P or * S S) and nonradioactive 
labelling I for example, chemiluminescent labelling, for example, fluorescein labelling). 

By "purified antibody" is meant antibody which is at least 60%, by weight, free from 
proteins and naturally-occurring organic molecules with which it is naturally associated. 
Preferably, the preparation is at leas: "V J „, more preferably Wo. and most preferably at least 
l )9°-o. by weight, antibody, for example, an acquired resistance polypeptide-specific antibody. 
A purified AR antibody may be obtained, for example, by affinity chromatography using a 
recombinant!) -produced acquired resistance polypeptide and standard techniques. 

Bv "specifically binds" is meant an antibody which recognizes and binds an AR 
protein but which does not substantially recognize and bind other molecules in a sample, for 
example, a biological sample, which naturally includes an AR protein such as NPR. 

As discussed above, fundamental acquired resistance genes that are responsible for 
providing plants with the ability to protect themselves against pathogens have been identified. 
Accordingly, the invention prov ides a number of important advances and advantages for the 
protection of plants against their pathogens. For example, by providing AR genes as 
described herein that are readily incorporated and expressed in all species of plants, the 
invention facilitates an effective and economical means for in-plant protection against plant 
pathogens. Such protection against pathogens reduces or minimizes the need for traditional 
chemical practices (for example, application of fungicides, bactericides, nematicides, 
insecticides, or vincides) that are typically used by farmers for controlling the spread of plant 
pathogens and providing protection against disease-causing pathogens. In addition, because 
plants expressing one or more acquired resistance gene(s) described herein are less vulnerable 
to pathogens and their diseases, the invention further provides for increased production 
efficiency, as well as for improvements in quality and yield of crop plants and ornamentals 
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Thus, the invention contnbuies to the production of high quainv and high yield agricultural 
products: tor example, fruits, ornamentals, vegetables, cereals and field crops having reduced 
spots, blemishes, and blotches that are caused by pathogens, agricultural products with 
increased shelf-life and reduced handling costs; and high quality and yield crops for 
5 agricultural (for example, cereal and field crops), industrial (for example, oilseeds), and 

commercial f for example, fiber crops) purposes. Furthermore, because the invention reduces 
the necessity for chemical protection against plant pathogens, the invention benefits the 
environment where the crops are grown. Genetically-improved seeds and other plant 
products that are produced using plants expressing the genes described herein also render 

10 fanning possible in areas previously unsuitable for agricultural production The invention 
further provides a means for mediating the expression of pathogenesis-related proteins, for 
example, chitmase and GST, that confer resistance to plant pathogens. For example, 
transgenic plants constitutively producing an AR gene product are capable of activating PR 
gene expression, which in turn confers resistance to plant pathogens. Collective PR gene 

] 5 expression that is mediated by the AR gene product obviates the need to express indiv idual 
PR genes as a means to promote plant defense mechanisms 

The invention is also useful for providing nucleic acid and ammo acid sequences of an 
AR gene that facilitates the isolation and identification of AR genes from any plant species. 
Other features and adv antages of the invention will be apparent from the following 

20 description of the preferred embodiments thereof, and from the claims 

Detailed Description 
The drawings will first be described. 
Drawings 

25 Fig. 1 is a schematic illustration showing the physical map of\-l. thaliana 

chromosome I and the position of NPR1 . 

Fig. 2 A is a photograph of a Northern blot analysis showing the expression of the PR- 

1 gene in wild type plants(Col-0, lanes 1-3). nprI-2 mutant plants* lanes 4-6), nprI-2 

transformants with a noncomplcmenting cosmid (m305-2-"\ lanes 7-9), and nprl-2 
30 transformants with complementing cosmids (21 A4-P5-1 . lanes 10-12 and 21 A4-6-1 -1, lanes 

13-15). RNA samples were prepared from fifteen-day old seedlings grown on MS media 
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i lanes 1, 4, 10, and 13), MS media with 0.1 m\l INA (lanes 2, 5. S, 11, and 14), and MS 
media with U. 1 mM SA ( lanes 3, 6, 9, 1 2, and 1 5 ) 

Fig. 2B is a series of photographs showing disease symptoms (top panels) and BGL2- 
GL'S expression (bottom panels) induced by Psm ES4326 on wild-type (left panels), nprl-I 
(middle panels), and an nprl-l transformant with a complementing cosmid (21 A4-4-3-1, 
right panels). 

Fig. 2C is a panel of graphs show ing the grow th of Psm ES4326 in wild-type, nprl -2, 
and an nprl -2 transformant w ith a complementing cosmid ( 2 1 A4-P5- 1 ). Error bars represent 
v5°o confidence limits of log-transformed data as described by Sokal and Rohlf (Biometry 2d 
ed., vV.H. Freeman and Company. New York. 1981 ). 

Fig 21) is a panel of bar graphs show ing the disease rating of P. parasiuca NOCO 
infection in wild type, nprl -2. and an nprl -2 transformant with a complementing cosmid 
(2 1 A4-P5- 1 ). The disease rating scales are defined as follows: 0, no conidiophores on the 
plant; 1. no more than 5 conidiophores per infected leaf; 2. 3-20 conidiophores on a few 
infected leaves, 3, 6-20 condiophores on most infected leaves; 4, 5 or more conidiophores on 
all infected leaves; 5, 20 or more conidiophores on all infected leaves. 

Fig. 3 is a schematic illustration showing the restriction map of the 7.5-kb region 
containing the NPR1 gene. 

Fig. 4 is a schematic illustration showing the genomic sequence of the 7.5-kb region 
containing the acquired resistance nucleic acid sequence of the gene termed NPRJ (SEQ ID 
N O : ! ) fro m A rci b i cfops i s (hahana 

Fig. 5 is a schematic illustration showing the cDNA sequence (SEQ ID NO:2) and 
deduced amino acid sequence (SEQ ID NO: 3) of the acquired resistance protein termed 
NPR1 from Arabidopsis thahana. Amino acids numbered 262-289. 323-37 L and 453-469 
show homology to a mouse ankynn protein, an ankyrin-repeat motif, and a G-protein coupled 
receptor motif, respectively. 

Fig. 6A is a schematic illustration showing the alignment of the NPR1 amino acid 
sequence with mouse ankynn 3 ( ANKB). Two regions producing the highest scoring pairs 
(smallest sum probability - 0.0004) generated using a BLAST search are shown. The 
identical and similar amino acids ( H are highlighted in bold, circled letters. 

Fig 6B is a schematic illustration showing the alignment of the ankynn repeats in 
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N?R1 with the ankynn repeat consensus derived from Michaels and Bennett (Trends in Cell 
Bioloey 2:127-129, 1992) and Bork (Proteins- Structure. Function, and Genetics 17:363-374, 
1993). Since there are a few non-overlapping amino acids between the two derived 
consensus sequences, both are presented. In the consensus derived from Bork, the conserved 
5 features are indicated: t. turn-like or polar; o, S T; h. hydrophobic; capitals, conserved amino 
acids. Those amino acids identical to the consensus are highlighted in bold, circled letters. 

Fm. 7A is a schematic illustration showing the cDNA sequence 
(SEQ ID NO: 13) of an NPR1 hornolog isolated from Nicottana glutinosa 

Fig. 7 B ts a schematic illustration showing the deduced ammo acid sequence of the 
10 NPR 1 hornolog of hicotiana glutinosa (SEQ ID NO: 14) shown in Fig. 7 A. 

Fig. 8A is a graph illustrating the dosage effect of NPR1 on the resistance of 
transgenic Arabidopsis to the bacterial pathogen, Psm ES4326. Eight samples were taken at 
each time point for the Psm ES4326 infection (initial inoculant OD b00 =0.001 ). Error bars 
represent 95% confidence limits of log-transformed data. Colony forming unit is designated 
15 as cfu. 

Fig. 8B is a histogram showing the dosage effect of NPR1 on the resistance of 
transgenic Arabidopsis to the fungal pathogen, Peronspora parasitica NOC02. A spore 
suspension (3xl0 4 spores/mL) of P. parasitica was used for these infection studies, and the 
number of comdiophores on each plant was counted seven days after infection. The data 
20 were analyzed using WUcoxon two-sample tests. At the 95% confidence level, significant 
difference in growth was present between all pairs of samples except ColNPRl-M and 
ColNPRl-H, and Col and ColNPRl-L. 

Fig. 9A are photographs showing the restoration of inducible BGL2-GUS expression 
in 35S-NPR I -GFP transgenic plants. Seedlings were grown on either MS or MS-INA (0.1 
25 mM) media for fourteen days and stained for GUS activity. 

Fig. 9B is a photograph showing the complementation of the SA sensitivity in the 
Arabidopsis nprl mutant by 35S-NPR1-GFP Seedlings were grown for eleven days on MS 
SA (0.5 mM) medium. The NPR1-GFP transgene restored normal growth to nprl on SA. 
The mGFP transgene, however, was unable to restore normal growth to nprl . Note that the 
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NPRl-GFP line used was in the T : generation The observed 3:1 segregation ratio indicated 
that the transgenic plants contained a single locus KPR1 -GFP insertion. 

Fii^ l )C is a histogram showing the restoration of P parasitica resistance to the T\ 
NPR1-GFP transformants. IN A treatment (0.65 mM) was earned out seventy-two hours 
5 prior to infection with a spore suspension (3x1 (J 4 spores/mL). The disease symptoms were 
scored seven days after the infection with respect to the number of conidiophores on the 
plant. The disease rating scale is defined as: 0, no conidiophores on the plant; 1, no more than 
5 conidiophores per infected leaf; 2, 6-20 conidiophores on a few infected leaves; 3, 6-20 
conidiophores on most of the infected leaves; 4, 5 or more conidiophores on all infected 

] « • leaves: S. 20 or more conidiophores on ail infected leaves. Seedlings in the 0, 4, and 5 

categories were also examined for the presence of the NPR 1 -GFP transgene, and the number 
of NPR1-GFP transformants is indicated in the parenthesis. Most of the P parasitica 
resistant plants (0 category) contained the NPR1-GFP transgene; however, all of the sensitive 
plants (4 and 5 categories) were observed to segregate as non-transformants lacking the 

1 5 transgene. 

Fig 10 is a photograph showing the localization of NPR 1 -GFP in response to 
chemical activators of S AR. The transformants, containing either the NPR 1 -GFP (top and 
bottom panels) or mGFP transgene (middle panels) were grown for eleven days on MS or 
MS-INA media. GFP fluorescence was visualized by confocal microscopy in leaf mesophyll 
20 cells and guard cells. DIC is shown in the red channel and GFP is shown in the green 
channel. 

Figs. I 1 A-l 1G are a series of photographs showing the localization of NPR 1 -GFP in 
response to Psm ES4326 infection. Leaves of NPR1-GFP transformants were infiltrated on 
the left half with either Psm ES4326(Hg. 1 IB) or 10 mM MgCl 2 (Fig 1 IE) and stained for 
25 BGL2-GUS expression after three days. Pnor to GUS staining the leaves were analyzed for 
GFP localization on the infiltrated (Fig. 1 1 A and Fig. 1 ID) and the uninfiltrated (Fig. 1 1C) 
side Leaves of mGFP transformants were infiltrated with Psm ES4326 (Fig. 1 I F) or 10 mM 
MgC 1 : (Fig 1 1G) and analyzed for GFP localization. 
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Overview 

A genetic study was conducted using Arabidopsis ihahana as a model system to 
identify key elements that control the signaling pathway leading to the induction of acquired 
resistance i AR). for example, a system acquired resistance (SAR) response, to pathogen 
5 infection m plants. In wild-type Arabidopsis plants, SAR responses can be induced by 

treatment with 0. 1 mM salicylic acid (SA) or 0. 1 mM 2,6-dichloroisonicotmic acid (INA) or 
after an infection by an avirulent pathogen such as Pseudomonas sxringae pv phaseolicola 
NP3121 avrRpt? ( P s phaseolicola 3121 • avrRpiJ ). SAR is demonstrated by enhanced 
resistance to virulent pathogens, such as Pseudomonas svrmgae pv macidicola ES4326 \P.s 

10 macidicola ES432b), and by increased expression of pathogcncsis-related genes (for 

example, PR genes including PRP BGL2. and PR5). To facilitate detection of PR gene 
expression and identification of mutants that were aberrant in the SAR signaling pathway, a 
BGL2-GUS reporter gene was constructed and transformed into Arabidopsis thahana ecotype 
Columbia. This parental line containing the BGL2-GUS transgene was mutagenized by 

15 treatment of seeds with 0.3% ethyl methanesulfonate for eleven hours The M2 progeny of 
the mutagenized population were screened for the lack of BGL2-G[ ! S expression in the 
presence of the SAR-inducers SA and INA (Cao et aL Plant Gell 6: 1 583- 1 592, 1994). 

Using these techniques, the nprl-i ( nonexpresser of PR genes) mutant was isolated 
and found to have almost complete lack of expression of the BGL2-GUS reporter gene, as 

20 well as a lack of expression of the endogenous PRL BGL2. and PR5 genes in response to SA. 
INA, and avirulent pathogen treatments (Cao et al., Plant Gell 6:1583-1 592, 1 994). Further 
characterization of the nprl-1 mutant showed that mutations in the NPR1 gene completely 
blocked the induction of SAR. In the nprl-1 plants pretreatcd with SA. INA. or an avirulent 
pathogen, growth of virulent pathogens (for example, P.s maculicoia ES4326) was not 

25 inhibited, as found in the parental line carrying the wild-type NPRI gene. This finding 

demonstrated that the NPRI gene plays a key role in the signaling pathway leading to the 
establishment of SAR. 

Two additional nprl mutants, nprl-2 and nprl-3, were isolated on the basis that they 
were more susceptible to infection than wild-type plants by P s macidicola strain ES4326 
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(Glazcbrook et ai.. Generics 1 43:9" 7 ?-9S2, 1996). Genetic complementation tests showed that 
nprl-L nprl -2. and nprl -3 were allelic. 

The SPR1 gene not only controls the onset of systemic resistance, but also was found 
to affect local acquired resistance ("LAR"), the ability of plants to restrict the spread of 
5 virulent pathogen infections. In nprl mutant plants, the virulent pathogen P s. maculicola 
ES4326 grows to a greater extent and spreads further beyond the initial site of invasion than 
in the wild-type plants. The effects of the impaired SAR and LAR in nprl mutants is also 
evident when various strains of Peronospora parasitica were tested. Disease symptoms (i.e., 
downy mildew) were observed after infection by strains of P. parasitica to which the 

in u-ild-tvpe parental line of A* nhniopsis is resistant, showing the break down of the "natural" 
resistance in the nprl mutants. The effects of the nprl mutations appeared to be specific to 
the defense response. No significant morphological phenotypes were observed in three allelic 
nprl mutants, nprl-L nprl -2, nprl-3. However, when grown on medium containing a high 
concentration of SA (0 5 mM). the growth of all three nprl mutants was arrested at the 

1 5 cotyledon stage, and the seedlings were bleached. Wild-type plants were observed to grow 
normally in the presence of 0.5 mM SA. 

The phenotypes of the nprl mutants clearly demonstrated the biological significance 
of the NPR1 gene of Arabidopsis ihahana in controlling the defense response against a broad 
spectrum of pathogens. 

20 The NPR1 gene was cloned using a map-based positional cloning strategy. The 

location of NPR1 on the Arabidopsis genome was first delimited to a 7.5-kilobase (kb) region 
contained on cosmtd clones 2 1 A4-4-3- 1 , 2 1 A4-6- 1-1,21 A4- P5 - 1,21 A4-P4- 1. and 2 1 A4-2- 1 
by its ability to complement the nprl mutant. An SA-inducibie 2.0-kb RNA transcript 
encoded within this 7.5-kb region corresponding to NPR1 was identified by RNA blot 

25 analysis. Isolation of this acquired resistance gene facilitates the cloning of AR genes from 
plants of agricultural or economic importance. For example, engineering ectopic expression 
of AR genes (for example, an NPR gene) in crop plants, which is useful for providing novel 
strategies for creating plants with enhanced resistance to pathogen infection. 

There now follows a description of the cloning of an Arabidopsis AR gene, WPR 1 A 
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dcscnption is also provided of the cloning of the XPRi homoiog from Xicotiana glutinosa. 
These examples are provided for the purpose of illustrating the invention, and should not be 
construed as limiting. 

Genetic Analysis of SAR in Arabidopsis and the Isolation of nprl Mutants 
5 Using Arabidopsis ihahana. components of the signalling pathway in SAR 

downstream of SA and INA induction have been identified. Specifically, we sought 
Arabidopsis mutants that did not express PR genes in the presence of added SA or FNA. 
Because there is no visible phenotype known to be associated vsith such mutants, transgenic 
Arabidopsis plants were generated which expressed ^-glucuronidase (GUS) under the control 

10 of the Arabidopsis p- 1 ,3-glucanase (BGL2) promoter ( Dong et al., Plant CW/3:6I-72. 1991 ) 
The BGL2 gene is one of the PR genes regulated by SA (Uknes et al., Plant Cell 4:645-656, 
1992). Briefly, seed from the transgenic line (BGL2-GUS) were mutagenized with ethyl 
methanesulfonate (EMS), and the resulting mutants were screened after SA or INA treatment 
for aberrant expression of GL'S The results of these screenings showed that high levels of 

15 p-glucuromdase (GUS) activity could be assayed in a single well of a ninety-six well 

microtiter plate using a single leaf from a plant that had been grown for two weeks on plates 
containing SA or FN A. Screens were performed for Arabidopsis mutants that either 
expressed the BGL2-GUS reporter constitutively in the absence of SA or INA treatment or 
that failed to express the reporter gene following treatment with SA or INA. These screens 

20 led to the identification of a series of mutants called cpr and npr (constitutive expresser of PR 
genes and for fion-expresser of PR genes, respectively) which define genes that are involved 
both in the regulation of BGL2 specifically and SAR in general (Bowling et al., Plant Cell 
6:1845-1857, 1994;Caoet al.. Plant Cell 6 1 583- 1 592, 1994) 
Construction of BGL2-GUS Tra ns genic Arabidopsis 

25 An XbaPSphI fragment (2025 base pairs (bp)} containing 1746-bp of noncoding 

sequence upstream of the start codon of the Arabidopsis BGL2 gene was fused at the ATG 
sue to the coding region of the Escherichia coli uidA gene (referred to as the GUS gene) and 
transferred into the vector pBHOU which was then used to transform Arabidopsis ecotype 
Columbia (Valvekens et al., Proc Natl. Acad Sci I SA 85:5536-5540, 1988). Plants 
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homo/yeous for the BGL2-GCS construct were identified on the basis that progeny of these 
plants were resistant to kanamycin and the presence of the transgene that was detected using 
Southern hybridization. 

Mutagenesis of the BGL2-GL 'S Transgenic Line 
5 Mutagenesis was performed in the BGL2-GLS, BGL2-GL S transgenic line by 

exposing -36.000 seeds to 0.3% ethyl methanesulfonate for eleven hours. Seeds were sown, 
and the plants were allowed to self-fertilize to produce M : seeds, which were collected in 
twelve independent pools. 

Identification of the nprl-l Mutant 
M> The M> seeds were germinated on MS medium with the addition of 0.8% agar, 0 5 

mg/mL Mes (2-( A-morphoimoiethane-sulfonic acid), pH 5.7, 2% sucrose, 50 ug/mL 
kanamycin, and 1 00 uglmL ampicillin. Either 0.5 mM salicylic acid (S A) or 0.1 mM INA 
was added to induce systemic acquired resistance (SAR). After incubation for fifteen days, 
each seedling to be assayed was numbered, and a single leaf was then removed from each 
15 seedling and put into the corresponding sample well of a ninety-six-well microtiter plate that 
contained 1 00 of P-glucuronidase (GUS) substrate solution (50 mM Na 3 HP0 4 , pH 7.0, 10 
mM Na.EDTA, 0.1% Triton X-100, 0.1% sarkosyL 0.7 ^L/niL Pmercaptoethanol, and 0.7 
mtj/mL 4-methylumbelliferyl p-D-glucuronide). After all the samples were collected, the 
microtiter plate was placed under vacuum for two minutes to infiltrate the samples and then 
2" mcubated at 37 n C overnight. Samples were examined for the fluorescent product of GUS 
activity (4-methylumbellifone) using a long-wavelength UV light. Those seedlings which 
showed no GUS activity were identified on the MS plate and transplanted to soil for seed 
setting. This procedure was repeated in the progeny of these putative mutants to ensure that 
the mutant phenotype was heritable and to identify the homozygous mutants. Of 13,468 M : 
25 plants tested, 181 did not exhibit GL^S activity in the presence of either SA or INA. In the M, 
generation, 77 of 139 lines tested maintained a mutant phenotype for GUS activity, with 76 
nonresponsive to both SA and INA and one line nonresponsive to SA but responsive to fNA. 

Three classes of mutations were predicted to be earned by the mutants that were 
nonresponsive to SA or INA treatment: ( 1 ) mutations in iegulatory genes which not only 
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affect expression of the transgene. but also the endogenous PR genes, (2) mutations in the 
promoter of the transgene which affect the responsiveness of BGL2-GUS. but not that of the 
endoizenous PR genes to SA and IN A, and (3j mutations m the coding region of the GUS 
uene which abolish the enzymatic activity of GUS. but not the transcription of GUS mRNA. 
To distinguish betw een these classes, the expression of endogenous PR genes was analyzed 
m the NT, generation. Regulatory gene mutants should be readily distinguished in the M, 
generation by an aberrant level of expression of other SAR-related PR genes. 

RNA ge! blot analysis v>as performed with these " mutant lines to identify those 
with modified expression of PR genes The expression of the Arahuiopsis mitochondrial 
p-ATPase gene served as a control tor sample loading Among the 77 mutant lines, six were 
found to have reduced expression of the endogenous PR genes to some degree (class 1 ); three 
showed aberrant expression only in BGL2-GUS (class 2); and fourteen were found to have 
reduced GUS activity but normal transcription ot BGL2-GUS (class ?) One class 1 mutant 
(nprl-l) exhibited a dramatic reduction in expression of the GUS\ BGL2, and PR- 1 genes 
compared to the wild-type in the presence of SA or IN A. Therefore, nprl-l was selected for 
further study 

The nprl-l mutant was tested for the induction of PR-5, another PR gene that has 
been cloned in Arahuiopsis (Uknes ct aL Plant Cell 4:645-656, 1 992). and a similar 
reduction in expression was observed. The reduction in PR gene expression after SA or IN A 
treatment was quantified for nprl-l relative to the parent BGL2-GUS line (representing the 
wild-type). In nprl-L the expression of both GUS and BGL2 was ten-fold lower than that of 
the wild-type and that of PR-5 was five-fold lower. The most dramatic reduction was 
observed for PR- 1 which was twenty-fold lower than the wild-type. 

Quantitative GUS Assavs Using nprl-l 

To measure accurately the level of GUS activity, a quantitative GUS assay was 
performed on nprl-l plants and the wild-type BGL2-GUS plants grown in the presence of 
either SA or IN A, or in the absence of both. In the absence of an inducer, the background 
level of GUS activity was five-fold lower in the nprl-l mutant than in the wild-type. 
Wild-type plants grown in the presence of 0.5 mM SA showed a fifty-two-fold increase in 
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GUS activ:t\ compared to the uninduced plants, whereas in the SA-induced nprl-I plants, 
the increase :n GUS activity was only seven-fold. Moreover, the induction by 0.1 mM FN A 
was forty-eight-fbld for the wild-type versus five-fold for nprl-l . Thus, while GUS activity 
in the SA- or INA-treated nprl-l plants was somewhat induced, the activity was at most only 
sliehtlv higher than the background level of the untreated wild-type. 
Genetic Analysis of the nprl- 1 Locus 

A backcross of nprl-l/nprl-l with its wild-type parent (NPRl NPRl in the 
BGL2-GUS background) resulted in F { progeny {NPRl nprl sixteen plants were tested) 
with the same pattern of GUS staining (using 5-bromo-4-chloro- 

3-indoiyl glucuromde [XGlucl as the substrate) observed in the v\ iid-type after SA or 1NA 
treatment. GUS staining was not detected in the SA- or INA-treated nprl-l/nprl-l 
homozygous plants even after two days of incubation at 28 °C. Self-fertilization of the F, 
plants produced F 2 progeny that segregated for GUS activity, intense staining or complete 
absence of staining, which were present with a ratio of 219:64 among the 283 F 2 plants 
examined, demonstrating that the mutant phenotype is recessive and due to a single nuclear 
mutation (x : ~-0.86, P>0.1). 

SA-, INA-. and Avirulent Pathogen-Induced Protection Against Pseudomonas 
synngac pv maculicola ES4326 Infection in Wild-Type and nprl-l 

To examine whether the lack of SA- or INA-induced PR gene expression would affect 
SAR protection against a virulent pathogen infection, fifteen-day-old wild-type and nprl-l 
plants were treated with either I mM SA or 0.65 mM IN A, and two days later were exposed 
to a P.s. maculicola ES4326 bactenai suspension Significant protection was observed in the 
SA- or IN A treated wild-type plants with less than ten percent of plants showing slight 
veiiowmg Chlorotic lesions developed in about ninety percent of the untreated wild-type 
control plants not pretreated with SA or INA. However, such SA- or FN T A-mduced protection 
was not observed in nprl-l mutant plants. Chlorotic lesions were clearly seen in over ninety- 
percent of untreated and at least eighty-percent of SA- or INA-treated plants. The symptoms 
on nprl-l were also more severe than on the wild-type plants. Treatment with only 1 mM 
SA, 0 65 mM INA, or surfactant (0.01 % Silwet-^7. used for the bacterial infection) had a 
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nunirnal effect on both the wild-type and the nnrl-i plants 

The growth of P s macuhcola ES4326 was measured :n both wild-type and nprl- 1 
plants that had been treated with water, SA. or INA two days before P s macuhcola ES4326 
infection. Leaves were collected 0, 0.5, 1 .0. 2 0, and 3.0 da\s after bacterial infiltration. For 
the untreated wildtype plants, P.s. macuhcola ES4326 proliferated 10,000-fold during this 
time period. However, for SA- or INA-treated w ild-type plants, the growth of P.s. 
macuhcola ES4326 was only about ten-fold, 1000 times lower than the untreated control. A 
Student's t test of the difference between the means at the three-day time point clearly showed 
that erowth of the pathogen is inhibited in the wild-type plants treated with SA or INA 
compared to those sprayed with w ater ( P ~ 0 001 ). Such a dramatic difference in P.s. 
macuhcola ES4326 growth, which resulted from SAR protection, was not observed in the 
nprl -I plants, where a Student's / test showed no statistically difference m growth after three 
davs for all conditions (P>0.05); the growth of P s macuhcola HS4326 in nprl -I plants was 
similar for mock-treated and cither SA- or INA-treated plants. Comparing the untreated 
nprl -J plants with the untreated wild-type, the level of P.s. macuhcola ES4326 appeared to 
have reached saturation one day earlier m the mutant than in the wild-type. Moreover, the 
difference in P.s. macuhcola ES4326 growth between the SA- or INA-treated wild-type and 
nprl- 1 was 500- to 1000-fold. 

To test the response to an avirulent pathogen, the nprl-] plants were infiltrated with 
P.s. macuhcola ES4326 carrying an avirulence gene avrRpt2 as described by Dong et al. 
{Plant Cell 3:61-72, 1991 ) and Whalen et al. [Plant Cell 3:49-59, 1991). A typical HR was 
observed in these nprl -I plants as characterized by the rapid appearance of necrotic lesions, 
detection of autofluorescence in the cell wall regions of the infected cells, and inhibited 
urowth of P.s. macuhcola ES4326 avrRpt 2 . The ability of this avirulence gene to induce 
SAR in nprl -I plants was then tested. To distinguish the inducing bacterial strain from the 
challenging strain, the bean pathogen Pseudomonas svnngae pv phascohcola strain NPS3121 
(P.s. phascohcola NPS3121; (Lindgren et al.,./. Bactenol. 168:512-522, 198b)) containing 
the avrRpt 2 gene was used to induce SAR in both the nprl -1 and wild-type plants. P.s. 
phaseohcola NPS3121 by itself caused no disease symptoms or visible HR on Arabidopsis 
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eeotype Columbia, while P.s. phascohcola NPS3121 a\rRpt2 elicited a strong HR (Yu et a!.. 
Sfol Plant-Microhc Interact. 6:434-443, 1993). Three days after the inoculation, uninfected 
leaves on the same plants were challenged with the virulent pathogen P.s. maculicola 
ES4326, and the growth of P.s. maculicola ES4326 in the plants was measured. A significant 
reduction in bacterial growth was observed in the wild-type plants pre-inoculated with P.s. 
phascolicola NPS3 1 21 avrRpi 2 compared to the mock treated samples (300-fold); however, 
no difference in P.s. maculicola ES4326 growth was detected in nprl-1 plants. 

Disease Symptoms and BGL2-GUS Expression Induced by P.s. maculicola ES4326 
Infection in Wild-Type and nprl-1 

P.v maculicola ES4326 was able to establish infection in SA-. INA-, and avirulent 
pathogen-treated 

nprl-1 plants as well as in the untreated plants. The lesions formed on the untreated mutant 
plants and the untreated wild-type were further compared. For this purpose, the P.s. 
maculicola ES4326 suspension was infiltrated into four-week-old wild-type and nprl-1 
leaves. The injection was controlled so that only half of the leaf was infiltrated with the 
bacteria. This could be monitored by the soaking appearance of the hal f-leaf. Forty-eight 
hours following infiltration, chlorotic lesions were visible on the wild-type leaves. These 
lesions were normally confined to the infiltrated halves of the leaves as defined by the midnh 
vein Different lesions were observed on the nprl-1 leaves, where the lesions were more 
diffuse and often spread into the uninfected halves of the leaves. Sampling of twelve leaves 
from both wild-type and nprl-1 plants revealed significant growth of the bacteria in the 
uninoculated half of eleven nprl-1 leaves compared to none of the wild-type leaves. 

For the leaves infected with P.s. maculicola ES4326, the pattern of BGL2-GUS 
expression was examined by X-Gluc staining. In a wild-type leaf, a high level of GUS 
staining was detected in the peripheral region of the lesion. In contrast, no significant GUS 
activity was detected on the nprl-1 leaf, where the lesion was more extensive than on the 
wild-type. 

C onclusions About nprl-1 

The data described above indicates that nprl-1 harbors a trans-acung mutation(s) 
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affectmg the response to SA and INA. The possibility of nprl -I being a mutant affecting the 
uptake of exogenously applied SA or INA is ruled out by the observation that the expression 
of PR I induced by P.s. macuhcola ES4326, instead of by exogenously applied SA or INA, is 
also reduced in the nprl-l mutant. The failure of SA or INA to protect the nprl-l mutant 
5 from infection by P.s. macuhcola strain ES4326 (in contrast to the protection observed in 
wild-type plants) indicated that the nprl-l mutation blocks SA or INA induction of 
resistance. Even though the HR elicited in the nprl-l mutant by bacteria carrying the 
avirulence gene avrRptJ was similar to thai described previously in wild-type plants (Dong et 
aL Plant Cell 3:61 -72, 199! . Whalcn et aL Plant Cell 3:49-59. 1991 ), the HR-induced SAR 
10 protection against infection by the virulent pathogen P s macuhcola ES4326 was absent in 
the nprl I plants. This indicated that nprl-l is a mutation that prevents the onset of SAR 
These phenotypes of the nprl - 1 mutation indicated that the function of the wild- type NPRl 
gene is to qualitatively and quantitatively regulate the expression of SA- and IN A-responsive 
PR genes. 

1 5 Genetic analysis of the progeny of an nprl-l/npr! -1 X NPRl /NPRl backcross 

indicated that a single recessive nuclear mutation determines the "nonexpresser of PR genes" 
phenotypc of the nprl-l mutant. This also indicated that the NPRl gene acts as a positive 
regulator of SAR responsive gene induction While the gene could be a negative regulator 
which is inactivated by SAR induction, a mutation abolishing such regulation would likely be 

20 dominant. Furthermore, the fact that a single mutation (that is, nprl-l) affects the 

responsiveness of this mutant to SA-, INA-, and pathogen induction indicated that SA, INA, 
and pathogens activate a common pathway that leads to the expression of PR genes. 
Identification of the Arahidopsis nprl-2 and nprl -3 Mutants 

To identify novel Arahidopsis mutants that negatively affect the induction of SAR. an 
25 alternative mutant screening strategy was employed 

We have observed that the final density to which the virulent pathogen P.s. 
macuhcola ES4326 will grow in an Arahidopsis leaf is directly related to the dose at which 
P.s. macuhcola ES4326 was infiltrated. The observed phenotypes of two additional types of 
Arahidopsis mutants also supported this conclusion Specifically, a senes of Arahidopsis 
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mutants were identified that accumulated reduced levels of the phytoalexin called camalexin, 
a phytoalexin tha: has been round in significant quantities in Arabuiopsis (Glazebrook and 
Ausubel, Proc. Natl Acad. Sci USA 91:8955-8959, 1994; Tsuji et a!.. Plant Physiol. 
9S: 1 304- 1 309, 1992). Importantly. P.s. maculicola ES4320 formed disease lesions and grew 
5 to higher titers on some of these pad (phytoalexin deficient) mutants when inoculated at 
doses below the threshold dose required to give disease symptoms in wild-type plants. 
Similarly, nprl-1 mutants exhibited a similar enhanced susceptibility phenotype as pad 
muiants (Cao et aL Plant Cell 6:1583-1592. !994) 

Based on these findings that pad and npr mutants were more susceptible to low dose 

iu p s maculicola ES432b infection than wild-type plants, a screen was performed to isolate 
additional eds (enhanced disease susceptibility) mutants (Glazebrook et al., Genetics 
143:973-982, 1996). Two leaves of M2 generation mutagenized Arabuiopsis plants were 
infected at a dose of strain P.s. maculicola ES4326 at which wild-type plants showed very 
weak symptoms manifested as small chlorotic spots three days after infection, whereas pad 

15 and npr I mutants showed large areas of chlorosis. A total of fifteen eds mutants that 
reproducibly allowed at least one half lug more growth cf P.s. maculicola ES4326 as 
compared to wild-type were identified among 12,500 plants screened. Because some pad 
mutants as well as nprl-1 mutants have the same enhanced susceptibility phenotype with 
respect to P.s maculicola ES4326 as the eds mutants (Glazebrook et al.. Genetics 143:973- 

20 982, 1996), the fifteen eds mutant* were tested to determine whether they synthesized 
wild-type levels of camalexin in response to infection by P.s. maculicola ES4326 (pad 
phenotype) and whether PR I gene expression can be induced by salicylic acid (nprl-1 
phenotype). The results of these analyses showed that two of the eds mutants exhibited an 
nprl-Vikc phenotype. Genetic complementation analysis showed that these two mutations are 

25 allelic to nprl-1 . These two mutants were re-named nprl-2 and nprl-3. 

Map-Based Positional Cloning of the Arabidopsis NPR1 Gene 

To map the NPR1 gene, a genetic cross was made between the nprl-1 mutant (present 
in the Columbia ecotype (Co! -CM which earned the RGLJ-GL/S reporter gene) and the wild- 
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tvpe (present in Landsberg erccta ecotype (La-tv) which carried the BGL2 '-(/' \S reporter 
gene) F3 families from this cross that are homozygous for this mutation at the NPRl locus 
were identified by their lack of expression of BGL2-GUS when grown on plates containing 
0.1 mM INA. Expression of the GUS reporter gene was detected by a chromographic assay 
5 of GUS activity using the substrate 5-bromo-4-chloro-3-indolyl glucuronide according to 

standard techniques (Cao et al.. Plant Cell 6: 1 583- 1 592, 1994 and Jefferson Plant MoL Biol. 
Reporter 5:387-405, 1987). The leaf tissues of these F3 nprl-1 progeny pools (from thirty to 
fortv two-week-old seedlings) were collected and frozen m liquid nitrogen. From the frozen 
tissues, genomic DNA preparations were made as described by Dellaporta et al. (Plant Mol 

10 Rial Reporter 1:10-21, 1983) and used to determine the genotypes of various restriction 
fragment length polymorphism (RFLP) and codominant amplified polymorphic sequence 
(CAPS ) (Konieczny and Ausubel, Plant J 4:403-410, 1993 ) markers. The frequencies of 
recombination between the NPRl locus and the RFLP and CAPS markers were used to 
determine the position of the NPRl gene according to conventional methods. 

15 As shown in Fig. 1, the NPRl gene was mapped to Arahidopsis chromosome I, and 

found to reside between the CAPS marker GAP-B (-22.70 cM on the centromenc side of the 
NPRl gene) and the RFLP marker m3 15 (-7.58 cM on the telomenc side of the NPRl gene) 

To carry out fine mapping of the NPRl gene, new CAPS and RFLP markers were 
generated from clones that the genetic maps in the AtDB database (http: genomc- 

20 www. Stanford. edu/Arabidopsis/) showed were located between GAP-B and mil 5. Cosmid 
g4026 (CD2-28, Arabidopsis Biological Resource Center, The Ohio State University, 
Columbus, OH) was cut with the restriction enzyme EcoRl and a 4-kb fragment was used to 
identify a polymorphism between Col-0 and La-rr after the genomic DNA was digested with 
Hindlll. L ; smg this RFLP marker, six heterozygotes were detected among the twenty-three 

25 F3 families that were heterozygous at GAP-B. None were found among the seven F3 families 
that were heterozygous at m3l 5. Therefore, g4026 is 5.92 cm on the centromenc side of the 
NPRl gene. Cosmid gl 144? (obtained from the collection of Dr. Howard Goodman at the 
Massachusetts General Hospital (Nam et al., Plant Cell 1 :690-705, 1989)) was used to 
generate a CAPS marker End-sequences of an 0.8-kb PeoR\ fragment were used to design 
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PCR primers ipnmer I: 5' GTGACAGACTTGCTCCTACTG ?' (SEQ ID NO. 15); primer 2: 
5' CAGTGTG TATCAAAGCACCA 3* (SEQ ID NO: 16) which amplified a fragment 
displaying a polymorphism when digested with the EcoR\' restriction enzyme. Among the 
436 nprl-I F3 progeny tested using this newly generated CAPS marker, seventeen 
5 heterozvgoies were discovered. Since these heterozygotes were ali homozygous Col-0 for the 
GAP-B locus, the gJ 144 ~ marker was placed ~ 1.95 cM on the telomenc side of the NPR1 
gene. 

There are a number of RFLP markers mapped between gl 1447 and g4026. The first 
marker tested was m305 (designated CD1-1 1, Arahidopsis Biological Resource Center, the 

lu Ohio State University, Columbus, OH (Chang et al., Proc Sal! Acad Set . USA 85:6856- 
6860, 1988)). A 5-kb EcoRl fragment isolated from the m305 lambda clone was further 
subcioned using Sa[\:Xba\ and the end-sequences of a 1 .6-kb fragment were used to design 
PCR primers ipnmer 1 : 5' TTCTCCAG ACCACATGATTAT 3'(SEQ ID NO: 17); pnmer 2: 
5' TGAAGCTAATATGCACAGGAG 3' (SEQ ID NO: 18)). The resulting PCR fragment 

1 5 amplified using these primers was digested with Haelll to detect a polymorphism. Among 
the 305 npri-l progeny examined using this m305 CAPS marker, no heterozygotes were 
found, indicating that the m305 marker lies extremely close to NPR1 . 

A partial physical map of chromosome I 

20 (http: v cb:! humgen upenn edu/-atgc/ATGCUP.html) showed a YAC contig that includes 
m305. The YACs in this contig, as well as left-end-fragments of YAC clones yUPl 9H6 t 
\UP21 A4, and yUPHH9 were obtained from Dr. Joseph Ecker at the University of 
Pennsylvania The vUP19H6L end-probe was found to detect an Rsa\ polymorphism, and 
five recombinants were identified among the GAP B recombinants on the centromenc side of 

25 the NPR1 gene (as shown by the vertical arrows in Fig. 1 ) The yUPl 1H9L end-probe was 
found to detect a HindlU polymorphism, and one heterozygote was found among the 
seventeen recombinants for gll44 7 on the telomenc side of the SPR1 gene (as shown by a 
vertical arrow in Fig. I). Since yUPl 1H9L hybridized with the \ UP19H6 YAC clone, these 
results showed that the \PRI gene is located on vl'PM'HO. In addition to m30\ >UP21 A4L 
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(detects an EcoRl polymorphism) and gS020 (a 1 3-kb EcoRl fragment that detects a Hind\\\ 
polymorphism) were found to be very closely linked to the SPR1 gene with no recombinants 
identified. m305. vUP2 1 A4L, and g8G20 all hybridized to the yLT19H6 VAC clone, further 
supporting the conclusion that yUP19H6 contains the NPRI gene. 
5 Construction of a Cosmid Librae from the VAC Clone vt'PI9H6 

A genomic DNA preparation was made from the yeast strain containing the VAC 
clone yL'PI9H6. This DNA was partially digested with the restriction enzyme TaqL size 
selected on a l'J-40°o sucrose gradient, and cloned into the Cla\ site of the binary vector. 
pCLD04541 (obtained from Dr Jonathan Jones (Bent et al.. Science 265: 1 856-1 860, 1994)). 
In The pCLDC)4541 vector is a standard transformation vector used for preparing cosmid 

libraries. This plasmid carries a T-DNA polyhnker region, and tetracycline and kanamycm 
resistance markers. 

The cosmid clones were packaged into bacteriophage lambda particles using a 
commercial packaging extract (Gigapack XL, Stratagene, LaJolla, CA) and introduced into E 

1 5 coli strain DH5a according to the instructions of the supplier. The resulting library was 
found to contain approximately 40,000 independent clones. 
Generation of a Cosmid Co ntig Containing the NPRI Gene 

The cosmid library generated from the yeast strain containing yl_JP19H6 was plated 
( 1 .500 cfu plate) on LB medium agar (containing 5 ug^mL of tetracycline to select for the 

20 presence of pCLD04541 ) and incubated at 37^C overnight. Colonies were lifted onto 

membranes (GeneScreem Du Pont, New England Nuclear! and hybridization was earned out 
according to the protocol described by the manufacturer. The library was probed with 5-kb 
EcoRl, 6 5-kb EcoRl Xhol, and a 1 3-kb EcoRl fragments prepared from m36L\ yUP2 1 A4L, 
and gS020, respectively. The colonies that hybridized with these probes were identified and 

25 purified according to conventional methods. Cosmid DNA preparations were made from 

these positive clones using the alkaline lysis method described by Sambrook et al. (Molecular 
Cloning A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York. 1989). and 
the inserts were analyzed by HindlU restriction digestion and Southern hybridization using 
the probes stated above. The cosmids were found to form a single cosmid contig spanning 
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approximate! v SO-kb of Arabutopsis DNA Three of the five recombinants for yUP19HL 
were shown to be heterozygous at an RFL? marker detected by cosmid clone m305-3-\ (a 
5-kb Hind\\\ fragment) at the centromenc side of the contig, while the single heterozygotc 
detected by gSOJO marker was also detected by the cosmid clone g8020-6-3 (a 1 .25-kb 
5 HmdlU fragment) at the telomenc side of the contig. This showed thai the cosmid contig 
contained the SPR1 gene (Fig. 1 ). From this contig, fourteen cosmids which each have a 
minimum of 10-kb overlap with the neighboring clones (Fig. 1 ) were chosen to transform 
nprl mutant plants in complementation experiments. 
Complementation of the nprl Mutations 

j !; The cosmid clones contained in the E. < oli strain DH5a were transferred into the 

Agrohacterium tumefaciens strain GV3101 (pMP90) (Koncz and Schelh Moi Gen. Genet 
204:383-396. 1986) by conjugation using the helper strain MM294A (pRK2013) (Finan et 
al., J- Bacterial 167:66-72, 1986) The resulting A. tumefaciens conjugants were selected 
using 50 ng/'mL kanamycin and 50 jig/mL gentamycin. The A. tumefaciens strains carrying 

1 5 those fourteen cosmid clones were transformed into nprl -I (Cao et ah. Plant Cell 

6:1583-1592, 1994) and nprI-2 (Glazebrook et al , Genetics 143:973-982. 1996) using a 
vacuum infiltration method described by Bechtold et al. (CR. Acad. Set. Pans. Life Sciences 
316:1194-1199, 1993). The integrity of the cosmid clones in the A. tumefaciens cultures used 
for transformation were examined by Southern analysis. 

20 Transformants of nprl -2 were grown (22 °C in fourteen hours of light) and selected on 

MS medium agar (Murashige and Skoog, Physiol Plant. 15:473-497, 1962) containing 2% 
sucrose, 50 |ig/mL kanamycin, and 100 ng/mL ampicillin. Kanamycin-resistant 
transformants which developed true leaves and healthy roots were transplanted to soil. After 
two weeks of growth in soil at 22 "C m tourteen hours of light per day, 'leaves were collected 

25 from three transformants of each cosmid clone and soaked in 0.5 mM INA solution for 

twenty-four hours at 22 - C in fourteen hours of light per day. Leaf tissues were then collected 
and frozen in liquid nitrogen. Total RNA w as extracted from these leaf tissues, and an RNA 
blot was prepared as described by Cao et al. (Plant Cell 6: 1 583- 1 592, 1994). The blot was 
probed with a f# /-specific probe (a PGR product obtained by amplifying genomic 
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Arabuiopsis DNA with /^/-specific primers (sense primer 5' 
GTAGGTGCTCTTGTTCTTCCC3' (SEQ ID NO:! 9); anti-sense pnmer 
5'CACATAATTCCCACGAGGATC3' (SEQ ID NO:20)). 

In control experiments, the wild-type parental line showed the induction of the PR I 
5 eene bv FN A, while the nprl-2 mutant exhibited no induction of PR- J gene expression. 

SprI-2 transformants containing cosmids (three for each cosmid) 21A4-6-1-1, 21 A4-P5-1, 
21 A4-4-3-1, and 21A4-2-1 showed strong induction of PR I by FNA, while nprl-2 
transformants containing other clones (for example. M305-2-3, M305-3-9, and 2 1 A4-3- 1 > 
displaved no induction. Variations were observed in the intensity of RNA bands among three 

10 individual transformants sampled for each cosmid clone. These variations were likely to be 
the result of "position-effects." the effect of the insertion site in the chromosome on the 
expression ot the transgene. Cosmid clones 21 A4-4-3-1, 21 A4-6-1-1, 21 A4-P5-1, and 
21 A4-2-1 restored the ability of the nprl-2 mutant to respond to IN A induction and, 
therefore, complemented the nprl-2 mutation. Examples of INA induced PR1 are shown in 

15 Fig. 2 A. 

Transformants carrying each cosmid were also tested for SA induction of PR I 
expression by RNA blot analysis Examples of SA induction are shown in Figure 2A. The 
wild-type parental line exhibited a high level of PR] gene induction by SA, whereas the 
nprl-2 mutant exhibited only a minor induction (Fig. 2A) Transformants of the nprl-2 
20 mutant containing cosmids 21 A4-6-M, 21 A4-P5-1, 21 A4-4-3-1, and 21 A4-2-1 showed 

induction of PR1 by SA, while those containing the other clones displayed little induction. 

As shown in Fig. 1 , these four clones share a common region of 7.5-kb. 
Transformants of cosmid 21A4-P4-1 were not available when the experiment described 
above was conducted. However, according to its relative position, it is expected that this 
25 clone can also complement the nprl-2 mutation. 

The same fourteen cosmid clones were also transformed into the nprl-1 mutant. 
Since the nprl-1 mutant carries the BGL2-GUS reporter and the kanamycin resistance gene 
(NPTII), transformants of the cosmid clones could not be selected using kanamycin. Instead, 
transformants that complemented the nprl~i mutation were selected directly by growing the 
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seeds collected from the nprl-i plants infiltrated with A. uwictacicns on a high concentration 
of SA (0.5 mM) Those plants that developed green leaves were transplanted to another plate 
containing o. 1 mM FN A. and GL'S activity was measured one week after transplanting. 

To measure GUS activitv seedlings were numbered, and a single leaf was removed 
5 from each plant and placed in a microliter well containing 100 uL of GUS substrate 

(4-methylumbelhferyl p-glucuronide) in a solution as described previously (Cao et al.. Plant 
Cell 6:1583-1592, 1 994; Jefferson, Plant Mol. Biol. Reporter 5:387-405, 1987). After an 
overnight incubation at 37°C, the fluorescent product of GUS activity was examined under a 
lon^ wavelength UV light. As controls, twelve seedlings of the wild-type parental line 

in f RCU-2-GL'S) were tested, and ail showed intense fluorescence after growth on SA and IN A. 
Tw elve seedlings of the nprl-i mutant [BGL2-GUS) were also included in the experiment, 
and none displayed any increase in fluorescence From this experiment, nine seedlings 
carrying cosmid 21 A4-P4-1, five carrying 21 A4-P5-1, and six carrying 21 A4-2-I were found 
to have high levels of fluorescence, i.e., GUS activity, and none of the seedlings from other 

15 cosmid clones were identified through this selection. Direct identification of putative 

complementing transformants in the nprl-l mutant plants by the cosmid clones 21A4-P4-1, 
2 1 A4-P5- 1 , and 2 1 A4-2-1 as in the transformation experiment using the allelic nprl-2 mutant 
(where all transformants were first selected by kanamycin resistance before identification of 
the transformants that could complement the nprl-2 mutation using RNA blot analysis) 

Ik) further supported the conclusion from complementation experiments with nprl-2 that the 7.5 
kb region shared by cosmids 2 1 A4-4-3- 1 , 2 1 A4-6- 1 - 1 , 2 1 A4-P5- 1, 2 1 A4-P4- 1 , and 21 A4-2-1 
complemented nprl mutations, and that this 7.5-kb region contained the NPR1 gene. 

In addition to reduced PR gene expression, plants with nprl mutations display 
susceptibility to virulent pathogens even after SAR induction. These mutant phenotvpes 

25 were also complemented by the cosmids desenbed above For example, as shown in Figure 
2B, infection by the bacterial pathogen Psm ES4326 caused visible disease symptoms three 
days after infection. While the disease symptoms in the wild-type plants and the 
complemented nprl-l transformants were well-confined to the site of pathogen infiltration 
( the left SK j c of the leaf), the lesions in the nprl-l plants were found to spread beyond the site 

?n of infiltration. In addition, when the dosage of infecting bacteria was reduced 10-fold, severe 
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disease svmptoms were only observed in the nprl-l mutant {leaves on the right) This 
experiment showed that 21 A4-4-3-I complemented the enhanced susceptibility to Psm 
ES4326 displayed by nprl-l . 

The expression of the BGL2-GUS gene was also analyzed in the same leaves after 
5 examination of the disease symptoms (Fig. 2B). Strong GCS expression (blue staining) was 
detected in the marginal regions of the well-confined lesions in the wild-type plants, but was 
absent from the diffuse lesions in the nprl-l plants. Reporter gene expression was restored in 
complemented transformants. 

In addition to these visual observations, as shown m Fig. 2C. bacterial growth of Psm 

10 ES4320 was measured quantitatively in wild-type, nprl-2. and 'dnnprl-2 transformant with a 
complementing cosmid (21 A4-P5-1 ). Plants were treated with 0.65 mM 1~NA seventy-two 
hours prior to Psm ES4326 infection (0D b0 n = 0.001 ). Infection of Arabidopsis with Psm 
ES4326 was performed according to standard methods (Bowling et aL 1994; supra. Cao ct 
al., supra, 1994; Glazebrook et aL supra, 1996). Samples were taken before infection and 

1 5 one, two, and three davs after infection. Six to eight samples were taken for each time point 
analyzed and colony-forming units of Psm ES4326 were determined per leaf disc. Complete 
inhibition of Psm ES4326 growth was observed in the wild-type plants following IN A 
treatment three days pnor to infection, whereas an approximate 10-fold decrease in Psm 
ES4326 growth was observed in the nprl-2 mutant subjected to the same treatment. The 

20 growth of Psm ES4326 was also halted in the complemented transformants after IN A 

treatment. Lower bacterial growth (as great at 10 l -fold) was observed even in the water- 
treated transformants compared to the water-treated wild-type (Fig. 2C) and the water-treated 
transformants carrying noncomplementing cosmids. This enhanced resistance may result 
from the increased NPR1 mRNA levels in these complemented transformants 

25 A test of resistance to a fungal pathogen. P parasitica NOCO, was also performed to 

venfv complementation of the nprl-l mutation Infection of Arabidopsis with P. parasitica 
NOCO was performed according to standard methods ( Bowling et al.. supra, 1 994; Cao et 
ah, supra. 1994; Glazebrook et al., supra. 1996). IN A treatment {0.65 mM) was earned out 
seventy-two hours prior to infection with a spore suspension (3 x 1(T spores/ 1 mL). Seven 

30 days post-infection, the disease symptoms were scored with respect to the number of 



WO 98/06748 



POYLS97/13994 



MmJ ,op„ores observed on each plan, A ,o,a, oitwemv io ,wem>-,n e plants were examined 
,or each eenotvpe w,th each treaimem. Data were analvzed using the Mann-^.tney U-Tes, 
,Soka, and Rohlf. supra, As shown ■„ F,g. 2D. ,h= result of Ihese experiments ,nd,c at ed 
ina, INA-mduced resistance to P. paras„:ca NOCO was restored ,n the transforms w„h 
the complementing cosmids. 

Analyses of the 7 S-kb Rerm n T ontninipp the VP/?/ Gene 

The 7 5-kb reston identified by the cosmui complementation expenment was further 
apalv .ed us.n, restriction enzymes. The resulting restriction map from th,s analysis is shown 
in Fiu ^ Three sets of subclones were made us.ng H^U. Alal, and ClaMXhoX digestions 
^ cosmid21A4-P5-l.wh.ch has the 7.5-kb region located ,n the center of the insert, and 
Ugated ,nto the vector pBluescript II SK" (Stratagene. La Jolla. CA). The 7.5-kb region ot 
.merest was represented by five Hindlli subclones with the approximate insert s.zes 1 .96-kb. 
191 kb 174-kb 1.25-kb.and0.50-kb. Subclones with larger inserts (Xbal: -8. 5-kb. -8.5- 
kb. - 1 45-kb: ClaMXhol. - 10.0-kb. and -5.1-kb) were also made to orient and connect these 
HindlU fragments. 

A Southern blot containing .he /Mil-digested genomic DNA samples from the 
w,ld-type parenta. line (BGL2-CUS) and the three n P rl mutants was examined with probes 
derated tamWmflll fragments made from the cosmid clone 2, A4-P5-1. No significant 
"deference in the restneon patterns was observed between the w.d-.ype and all three npr, 

., .. .,,„.„ T ,„. fo „ ...s.ml.kelv that these mutants earned a substantial deletion in 

allelic mutants ii^"" J '-'-'- 

the S'PRl gene. 

DN A fragments covenng the 7.5-kb region were used to delect transcripts on a blot 
con.a.n.ng the polyA mRNAs made from four-week-old plants of the wild-ype parental line 

, - l . ot -~. v,~., r c „t>*r treatment of the plants with H ; 0 
and of the three npr I allelic mutants ^veiav-.w. „o-r. 

or 0 65 mM IX A and 2 mM S A. The polyA mRNA samples were prepared using Dynabeads 
(Dvnal Inc.. Lake Success. NY) from seventy-five micrograms of total RNA according to the 
protocol provided by Dynal. From this analysis, only one -2.0-kb mRNA was detected ,n the 
- S-kb reuion us.ng probes made from the 0.5-kb and the adjacent 1 . % -kb .Will fragments. 
Th,s mRNA represented a putat.ve transenpt of the NPR ! gene In addmon. the intensity of 
thls transcn pt was about two-fold higher ,n the INA SA-induced samples compared to the 
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H O-treated controls as measured by a Phosphorlmager and ImageQuant {Molecular 
Dynamics, Sunnyvale. CA). Thus, the expression of this transcript believed to represent 
mRNA of the NPR1 gene was induced by IN AS A treatment. No significant difference in the 
pattern of expression was discovered between the wild-type and three nprl mutant alleles on 
this polyA RNA blot. 
Se quence Analysis o f the XPR I Gene 

The initial sequencing analysis was earned out using pBluescnpt SK clones of the 
five Hind\\\ fragments as templates. The template DNA samples were prepared using Qiagen 
Plasmid Mini Kits (Qiagen Inc., Chatsworth. CA), and 0.6 ug of the template was used (or 
each sequencing reaction and analyzed by an ABI automated sequencer 

M 1 3-20 and M 1 3 reverse primers were used to initiate the sequencing reactions of the 
HincJlll fragments. Various restriction enzymes were then used to generate deletions in these 
HindlU subclones to analyze sequences more distal to the ends of the fragments. In addition, 
primers were designed to perform primer walking. The relative positions of these HindlU 
fragments were determined and gaps between these fragments were filled by sequencing 
analyses using A7?aI-subclones of cosmid 21 A4-P5-1 as templates. The sequence data were 
analyzed to identify restriction enzyme sites, to perform sequence alignment and to search for 
open reading frames using standard DNA analysis software (DNA Strider 1.1. Mac Vector 
4.0. 1 , and GeneFinder). Using this software only one putative gene was found Sequence 
data were also compared to the TIGR Arahidopsis thaliana DataBase 
(http://www. tigr.org/tdb/atyat. html) The results of this study identified an expression 
sequence tagged (EST) clone that showed homology with a portion of the 1.96-kb fragment. 
This portion of the 1.96-kb fragment was also identified as part of the gene recognized using 
GeneFinder software The nucleotide sequence of the \5-kb genomic region encoding the 
NPR1 gene product is shown in 
Fig. 4. 

Isolation of XPR I cDNA Clones 

A cDNA library that was constructed by Dr. Katagin (and described in detail in 
Mindnnos et ah, Cell 78 : 1 089- 1 099, 1 994) was screened using the 1 .96-kb HindlU fragment 
as a probe. Bacterial cells (E coli DH10B; GIBCO BRL. Gaithersburg, MD) containing 
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cDNAs made from the aenal pans of one-month old wild-type Arabuiopsis plants in vector 
pK£\4tr were plated (00,000 cfu plate) on LB medium containing 1 00 ug mL arnpicillm. and 
the plates w ere incubated at 37 = C for four and one-half hours. Colonics were lifted onto 
Colony Plaque Screen membranes (NEN Research Product; Boston, MA), and then the 
membranes were placed onto an LB plate, with the colony side up. Both plates were 
incubated at 30 C for twelve hours. The membranes were autoclaved for one minute to lyse 
the cells and fix the DNA to the membrane. Hybridization was performed at 42 °C in a 
solution containing 10% dextran sulfate. 50% formamide, 6X SSC, 5X Denhardt's, and 1% 
SDS; and the membranes were washed twice at 65 r C in 2X SSC and 1% SDS. The positive 
colonies were purified through secondary and tertiary screens using identical conditions. One 
positive cloned was subsequently identified and designated pKExNPRl 

The cDNA inserts were excised from the vector using restriction enzymes EcoRl and 
Sad. Southern analysis was performed using probes made from the 1.96-kb (the 3'-end of 
the open reading frame) and the 0.5-kb (the 5'-end ot the open reading frame) HindlU 
fragments to confirm homology of the cDNA clones. The nucleic acid sequence (SEQ ID 
NO:2) and deduced ammo acid sequence (SEQ ID NO:3> of the acquired resistance protein 
termed NPR1 from Arabidopsis thahana encoded by the 2.1-kb cDNA is shown in Fig. 5. 
Sequence analysis revealed that this cDNA contained sequences corresponding to those 
identified in the EST clone and deduced using the Gene Finder software. 

The cDNA sequence was analyzed using the BLAST sequence analysis program. 
This analvsis revealed that the NPR1 protein shared significant homology with ankyrin, 
including the region identified as the ankynn-repeat consensus. In particular, as shown in 
Fie. 6A, the NPRi sequence contains two regions with significant homology to the 
mammalian ankvrin 3 eene. The sequence identities between SPRJ (amino acids 121-171 
and 262-289) and ANK3 (amino acids 740-788 and 313-340) are 42% and 35%, respectively, 
and the sequence similarities are 59% and 57%, respectively. This ankynn-repeat consensus 
has been identified in a diverse array of proteins including transcription factors, cell 
differentiation molecules, structural proteins, and proteins with enzymatic and toxic 
net: \; i ics This motif has been shown to function hy mediating protein interactions. 

I sini: the consensus sequence defined by Michaely and Bennett (Trends in Cell 
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Bioloirx 2: 12"- 129, 1 ^92 ) and Bork (Proteins: Structure, Function, and Genetics P:36?-3 7 4. 
1993). T'-v :ditional ankynn repeats were identified m SPR1\ these are shown in Fig. 6B 

In addition, using the MacVector program, a 1 7 ammo acid motif of G-protein 
coupled receptors (MKGTCEFI VTSLEPDRL. Fig. 5, SEQ ID NO 21 ) has been found in the 
5 NPR1 protein [Science 244:569-572, 1989) 

The NPR 1 -determined Resistance is Dosage Dependent 

The ability of NTR- 1 to confer disease resistance was evaluated in transgenic plants as 
follows. The NTRi cDNA sequence (Fig. 5. SEQ ID NO:2) driven by the constitutive 

lu CaMY 35S promoter was transformed into Arahulopsis ecotypc Columbia according to 
standard methods. In the resulting T, lines homozygous for the 35S-NPR 1 transgenc, the 
expression of the NPR 1 -regulated PR-1 gene, MPR1 mRNA, and NPR1 protein were 
measured to identify those lines exhibiting high (Col NTRI H ). medium (ColNPRIM), and 
low (Col NPR 1 L) levels of NPR 1 expression. Table 1 shows the results of evaluating the 

15 relative levels of PR-L NPRI mRNA, and NPR 1 protein concentrations. 
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Table 1 

C haracterization of 35S-NPR1 T ransgenic Lines 

PR-] NTR1 NPRl 

Genotype t IN A)" < mRNA) 1 i.Protcinr 

Col 1-00 1-00 1.00 

Col-Ll 0-41 6.92 0.04 

CoFL2 0.54 0.90 <0.04 



Col-Ml 



9.20 1-40 



Coi-M2 1.80 0.50 140 

Col-Hl 2.60 17.80 1.60 



Col-H2 2.^4 27.90 3.00 

20 

J The relative levels of PR-! were measured by an RNA blot analysis in the 35S-NPR1 transgenic lines grown 
on plates containing 0 1 mM INA 

p The relative levels of NPR1 mRJMA were measured by a polyA + RNA blot. 
25 c The relative NPR1 protein concentrations were measured by ELISA using NPRl polyclonal antibodies. 



From these experiments, two lines of transformants were identified that had 
bi-mficantiv lower NPRl protein levels (but not mRNA levels) than the wild-type parent. 
This*, however, was not unexpected because overexpression of a transgene in plants often 
leads to co-suppression of the transgene as well as the corresponding endogenous gene 
(Baulcombe, The Plant CclL 8:1833, 1996}. 

The high-, medium-, and low expressing 35S-NPR; transgenic lines were next 
subjected to infection by the bacterial pathogen Pscudomonas svnnigae pv macuhcoia 
ES4326 and the fungal pathogen Peronospora parasitica NOC02 according to standard 
methods. The results of these experiments are shown in Figs 8 A and 8B, respectively. In 
the absence of SAR induction, the high- and the medium-expressing 35S-NPR1 transgenic 
lines showed significantly increased resistance to both bacterial and fungal pathogens while 
the low-expressing transgenic lines displayed reduced tolerance to the pathogens as compared 
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to the wild-type. Together, these results showed that NPR1 was a positive regulator of SAR, 
and that the NPR 1 -determined resistance w as dosage dependent, o\ erexpression of the NPR 1 
protein enhanced resistance whereas underexpression led to reduced tolerance to infection. 
N'PRl is Translocated to the Nucleus L'pon SA Induction 
5 To elucidate the induction mechanism and the molecular function of the protein, the 

subcellular localization of NPR 1 was determined by using standard reporter gene fusion 
construct analysis The green fluorescent protein (GFP) gene was fused to the carboxyl end 
of the NPR1 cDNA driven by the constitutive CaMV 3>S promoter, and the 35S-NPR 1 -GFP 
construct was used to transform nprl mutants, nprl-l and nprl- 2, according to standard 

K> methods In the resulting transgenic lines, the NPR 1 -GFP transgene was found to 

complement all the nprl mutant phenotypes; namely, the lack of SA- or INA-induced PR 
aene expression, the reduced tolerance to exogenous SA, and the lack of SA- or INA- 
induced resistance to pathogens (Figs. 9A-9C). Transgenic lines expressing the GFP alone 
(designated 35S-mGFP), exhibited no complementing activity (Fig 9B). In addition, the 

1 5 presence of the NPR-GFP transgene w as found to restore both inducible BGL-GUS 

expression and resistance to P parasitica as shown in Figs. 9A and 9C. respectively. These 
experiments therefore showed that the NPR 1 -GFP was biologically active and that the 
subcellular localization of NPR 1 -GFP should reflect that of the endogenous NTRi protein. 

To examine the subcellular localization of the NPR1 protein, the 35S-NPR1-GFP and 

20 35S-mGFP transgenic lines were grown in MS medium in the presence or absence of the 

SAR-inducing chemicals SA or INA. Eleven-day-old seedlings were subsequently examined 
using confocal microscopy to detect localization of NPR1 -GFP and mGFP. As shown in Fig. 
10, the 35S-NPR1-GFP seedlings grown on MS showed low levels of GFP throughout the 
mesophvll cells and strong GFP fluorescence in the nuclei of the guard cells Upon induction 

25 by SA or INA, NPR 1 -GFP was detected exclusively in the nuclei of both the mesophvll cells 
and the guard cells. In the 35S-mGFP transformants, green fluorescence was detected in the 
cytoplasm as well as in the nuclei, and SA and INA treatments had no effect on the 
localization of the protein. These results indicated tha: NPR: was localized in the cytoplasm 
in the mesophyll cells, and that upon induction the NPR1 protein was transported into the 

30 nucleus resulting in PR1 gene expression and resistance In the guard cells, the NPR1 protein 
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was localized in the nuclei even without an SAR induction, an intriguing observation because 
constitutive activation of defense mechanisms in these cells may be necessary to fend off 
microbial pathogens from gaming entry into the plant through stomata. Since mGFP alone 
showed no induced nuclear translocation, the nuclear transport of the NTRl-GFP fusion must 
5 he directed hv a signal in NPR1 Consistent with this, the following two potential nuclear 
localization sequences (NLSV) were found in NPR1 : 

252 RRKELGLEVPKVKK 265 (SEQ ID NO:22); and 

541 KKQRYMEIQETLKK 554 (SEQ ID NO:23). 

Significantly, nuclear translocation in tissues infected by the virulent pathogen Psm 
10 ES4326 was also observed (Fig. 1 1 A). This pattern of induction was also observed to 
coincide with the pattern of PR gene expression observ ed in plants after infection 
(Fig. 1 IB). 

Characterization of npr Mutations 

1 o further characterize the NPRl gene, the mutations in npr!-L npr 1-2, npr 1-3, and 

1 5 npr J -4 were identified by DNA sequencing. The mutant npr I -4 is a new npr] allele that was 
identified in the Col-0 (BGL2-GUS) background based on its enhanced susceptibility to Psm 
ES4326. Each mutant allele was found to contain a single base-pair change. The nprl-L 
nprl-2 y npr 7-3, and nprl-4 alleles respectively altered the highly conserved histidine (residue 
334) in the third ankynn-repeat consensus to a tyrosine, changed a cysteine (residue 150) to a 

20 tyrosine, introduced a nonsense codon (residue 400) that should result in a truncated protein 
lacking 194 amino acids of the C-terminal end of the protein, and destroyed the acceptor site 
of the third intron junction. All of these point mutations are GC to AT transitions, consistent 
with the mode of action of the mutagen, ethyl-mcthanesulfonate (EMS), used for the 
generation of the^e mutations. 

25 Genetic Analysis of the Plant Defense Response Usi ng Arabidoosis thaliana 

Although biochemical studies have played an important role in elucidating the general 
features of the plant defense response, the complexity of the defense response limits the 
utility of biochemical analysis in determining the importance of particular defense responses 
or enzvmes in conferring resistance to pathogens. Isolation of plant defense-response 

3n mutants not only helps elucidate the roles of known pathogen-induced responses in 
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combating particular pathogens, but also facilitates the identification of plant defense 
mechanisms not already correlated with a known biochemical or molecular genetic response 
With the development of well-characterized hostpathogen systems involving the model plant 
Arabidopsis thaliana as the host as described herein, comprehensive genetic analysis of 
5 acquired resistance responses is made possible. 

All of the major features of the plan: defense response that have been observed in crop 
plants have also been observed in Arabidopsis-palhogen interactions. For example, several 
resistance genc-avr gene interactions have been identified for both bacterial and fungal 
pathogens of Arabidopsis (Bisgrove et al , Plant Cell 6:927-933. 1994; Holub et al., Mol 

10 Plum-Microbe Interact. 7:223-239. 1994; Kunkel ct al.. Plant Cell 5:865-875, 1993; Yu et 
al., Mol Plant-Microbe Interact 6:434-443. 1993). Moreover, all of the important features 
of SAR have been observed in Arabidopsis (Uknes et ah. Plant Cell 4:645-656, 1992; Uknes 
et at., Mol. Plant-Microbe Interact 6:692-69S, 1993) Importantly, the power of Arabidopsis 
genetic analysis has recently been used to help identify a variety of components of the 

15 Arabidopsis defense response to pathogen attack (Bent et al., Science 265 : 1 856- 1 860, 1994; 
Bowling et al.. Plant Cell 6: 1 845-1 857, 1994; Cao et al.. Plant Cell 6:1583-1592, 1994; 
Century et al., Proc Natl Acad Set USA 92:6597-6601, 1995; Delaney et al.. Proc. Natl. 
Acad. Sci. USA 92:6602-6606, 1995; Dietrich et al., Celt 7~:565-577, 1994; Glazebrook and 
Ausubel, Proc. Natl Acad Sci. USA 91.895 5-8959, 1994; Glazebrook et al.. Genetics 

20 143:973-982, 1996; Grant et ah, Science 269:843-846, 1995, Greenberg and Ausubcl, Plant 
J. 4:327-341, 1993; Greenberg et al., Plant J. 4:327-341, 1994; Mindnnos et al.. Cell 
78:1089-1099, 1994). Thus, the results described herein provide the basis for identifying 
genes that are involved in acquired disease resistance throughout the plant kingdom and are 
not limited to Arabidopsis. 

2 5 isolation of Solanaceous AR Genes 

Using the Arabidopsis NPRI cDNA sequence shown in Fig. 5 (SHQ ID NO. 2), the 
isolation of .AR homologs that are found in solanaceous plants (e.g., potato, eggplant, tomato, 
tobacco, petunia, and pepper) is readily accomplished using standard techniques. 

For example, a Nicotiana glutinosa cDNA library was screened for the presence of an 

30 NPRI homolog. The library was constructed in the lambda ZAP II vector from poly 
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( A^)RNA isolated from Sicotiana gluiinosa plants infected with tobacco mosaic virus 
(7MY) (Whitham ct al.. Cell "8: 1101-11 15, 1994) Bacteriophage were plated on N T ZY 
media using XL- 1 Blue host cells. Approximately 10° plaques were screened by transferring 
the phage DNA onto positively charged nylon membrane {GeneScreen; DuPont-New 
England Nuclear) and probing with a random primed 3: P labeled probe that was prepared 
using the full-length Arabidopsis SPRi cDNA as the template. Hybridization was performed 
at 37 X in 40° o formamide, 5X SSC 5X Denhardt, 1% SDS, and 10% dextran sulfate. The 
filters were washed in 2X SSC for fifteen minutes at room temperature and 2X SSC 1% SDS 
for thirty minutes at 37 X\ 

Two hvbndizmg clones were identified and purified. The pBluescnpt plasmids were 
excised using XL- 1 Blue host cells and R408 helper phage. Restriction enzyme analysis 
indicated that the two positive clones contained inserts of approximately 3600 bp and 2100 
bp. Restriction digests and sequence analysis indicated that the 3600 bp insert represented 
two independent cUNAs of 2100 bp and 1 5UU bp and that the two independently isolated 
2100 bp cDNAs were identical. Both strands of the 2100 bp cDNA were sequenced using 
3 'S-dATP and the Sequenase sequencing kit (U.S. Biochemicals, Cleveland, OH). The 
nucleotide and amino acid sequences encoding the Nicotiana gluiinosa NPR1 homolog are 
shown in Fig. 7A (SEQ ID NO:13) and Fig. 7B (SEQ ID NO:14), respectively. 
Isolation of Other Acquired Resistance Genes 

Any plant cell can serve as the nucleic acid source for the molecular cloning of an AR 
gent; Isolation of an AR gene involves the isolation of those DNA sequences which encode a 
protein exhibiting AR-associated structures, properties, or activities, for example, an ankynn- 
repcat motif and the ability to induce gene expression of PR proteins that limit pathogen 
infection. Based on the AR genes and polypeptides described herein, the isolation of 
additional plant AR coding sequences is made possible using standard strategies and 
techniques that are well known in the art. 

In one particular example, the AR sequences described herein may be used, together 
with conventional screening methods of nucleic acid hybridization screening. Such 
ftvfrndi/ation techniques and screening procedures are well known to those skilled in the art 
and are described, for example, in Benton and Davis, Science 196 180, 1977, Grunstein and 
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Hoeness, Proc \u:l Acad. Sci . CSA ^2 39(^1, 19*^5: Ausubel et al. isup^a), Berger and 
Kimmcl i supra); and Sambrook et al.. Molecular Cloning: A Laboratoty Manual, Cold 
Spring Harbor Laboratory Press, New York. In one particular example, ali or part of the 
XPRI cDNA (described herein! may be used as a probe to screen a recombinant plant DNA 
library for genes having sequence identity to the AlR gene. Hybridizing sequences are 
detected by plaque or colony hybridization according to the methods described below. 

Alternatively, using all or a portion of the amino acid sequence of the AR 
polypeptide, one may readily design AR-specific oligonucleotide probes, including AR 
degenerate oligonucleotide probes (i.e.. a mixture of all possible coding sequences for a given 
amino acid sequence). These oligonucleotides may be based upon the sequence of either 
DNA strand and any appropriate portion of the AR sequence (Figs. 4 and 5, 7 A, and 7B SEQ 
IDNOS:1.2. 3. 13. and 1 4. respectively ) General methods for designing and preparing such 
probes are provided, for example, in Ausubel et al., 1996, Current Protocols in Molecular 
Biology, Wilev lnterscience, New York, and Berger and Kimmel, Guide to Molecular 
Cloning Techniques. 1987, Academic Press, New York These oligonucleotides are useful 
for AR gene isolation, either through their use as probes capable of hybridizing to .AR 
complementary sequences or as pnmers for various amplification techniques, for example, 
polymerase chain reaction (PCR ) cloning strategies If desired, a combination of different 
oligonucleotide probes may be used for the screening of a recombinant DNA library. The 
oligonucleotides may be detectably-Iabeied using methods known in the art and used to probe 
filter replicas from a recombinant DNA library Recombinant DNA libraries are prepared 
according to methods well known in the art, for example, as described in Ausubel et aL 
(supra), or they may be obtained from commercial sources 

In one particular example of this approach, related AR sequences having greater than 
8d° o identity are detected or isolated using high stringency conditions High stringency 
conditions may include hybridization at about 42 C and about 50% formamide, 0.1 mg'mL 
sheared salmon sperm DNA, 1% SDS, 2X SSC, 10% Dextran sulfate, a first wash at about 
65 L, C, about 2X SSC, and 1% SDS, followed by a second wash at about 65 C and about 0. IX 
SSC. Alternatively, high stringency conditions may include hvbndization at about 42 'C and 
about 50% formamide, 0.1 mg'mL sheared salmon sperm DNA, 0.5% SDS, 5X SSPE, IX 
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Denhardt's. followed by two washes at room temperature and 2X SSC. 0. 1 % SDS. and two 
washes at between 55-60 X and 0.2X SSC. 0.1% SDS 

In another approach, low stringency hybridization conditions for detect.ng AR genes 
having about 40% or greater sequence identity to the AR genes described herein include, for 
example, hybridization at about 42 C and 0.1 mg mL sheared salmon sperm DN'A. 1% SDS. 
2X SSC. and 10% Dextran sulfate (in the absence of form amide), and a wash at about 37~C 
and 6X SSC. about 1% SDS. Alternatively, the low stringency hybridization may be earned 
out at about 42 *C and 40% formamide. 0.1 mg.mL sheared salmon sperm DNA, 0.5% SDS. 
5X SSPE. IX Denhardt's. followed by two washes at room temperature and 2X SSC. 0.1% 
SDS and two washes at room temperature and 0.5X SSC. 0. 1% SDS. These stringency 
conditions are exemplary: other appropriate conditions may be determined by those skilled in 
the art. 

If desired, RNA gel blot analysis of total or poly(A-) RNAs isolated from any plant 
(e.e.. those crop plants described herein) may be used to determine the presence or absence of 
an AR transenpt using conventional methods. As an example, a Northern blot of potato RNA 
was prepared according to standard methods and probed with a 1.96-kb NPRl Hmd\\\ 
fragment in a hybridization solution containing 50% formamide, 5X SSC, 2.5X Denhardt's 
solution, and 300 ug,mL salmon sperm DNA at 37'C. Following overnight hybridization, 
the blot was washed two times for ten minutes each in a solution containing IX SSC, 0.2% 
20 SDS at 37 C. An autoradiogram of the blot demonstrated the presence an NPRl -hybridizing 
RNA in the potato RNA sample, indicating that this solanaceous crop plant encoded an 
acquired resistance gene. These results further indicate that AR genes are not restricted to the 
crucifer Arabidopsis. Isolation of this hybridizing transenpt is performed using standard 
cDNA cloning techniques. 
25 As discussed above, AR oligonucleotides may also be used as primers in 

amplification cloning strategies, for example, using PCR. PCR methods are well known in 
the art and are described, for example, in PCR Technology. Erhch, ed.. Stockton Press, 
London, 1989: PCR Protocols: A Guide to Methods and Applications. Innis et al., eds.. 
.Academic Press, inc.. New York. 1 990, and Ausuhel et a!. { supra) Primers are optionally 
3<> designed to allow cloning of the amplified product into a suitable vector, for example, by 
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inciudmg appropriate restriction sites at the 5 and ?' ends of the amplified fragment (as 
described herein). If desired, AR sequences may be isolated using the PCR "RACE" 
technique, or Rapid Amplification of cDNA Hnds (see, e.g., Inms et al. (supra)). By this 
method, oligonucleotide primers based on an AR sequence are oriented in the 3' and 5' 
directions and are used to generate overlapping PCR fragments. These overlapping 3'- and 
5'-end RACE products are combined to produce an intact full-length cDNA This method is 
described in Inms et al. \supra)\ and Frohman et ah, Proc Sail. Acad. Sci. USA 85:8998. 
19SS Exemplary oligonucleotide pnmers useful for amplifying AR gene sequences include, 
without limitation: 

A. AA( A/G)GA(/VG )GA(T C)C AflVC) A CN A A (SEQ ID NO:24); 

B TA(T/C)TG(T/C)AA(T/C)GTNAA(A/G)AC (SEQ ID NO:25); 

C GCCATNGTNGOT C)TG(T C)TT (SEQ ID NO:26); 

D AA( A/G)GTNAA(A G)AA( A/G)CA(C T)GT (SEQ ID NO:27); 

E (,V G)AA(CT)TC(A/G)CANGTNCC(C .'T1TTCAT {SEQ ID NO:28). 
For each of the above sequences, N is A. T, G or C. 

Alternatively, any plant cDNA or cDNA expression hbrary may be screened by 
functional complementation of an npr mutant (for example, the npri mutant described 
herein) according to standard methods described herein. 

Confirmation of a sequence's relatedness to the AR polypeptide family may be 
accomplished by a variety of conventional methods including, but not limited to, functional 
complementation assays and sequence comparison of the gene and its expressed product. In 
addition, the activity of the gene product may be evaluated according to any of the techniques 
described herein, for example, the functional or immunological properties of us encoded 
product. 

Once an AR sequence is identified, it is cloned according to standard methods and 
used for the construction of plant expression vectors as described below. 
AR Polypeptide Expression 

AR polypeptides may be expressed and produced by transformation of a suitable host 
cell with all or part of an .AR cDNA (for example, the cDNA described above} in a suitable 
expression vehicle or with a plasmid construct engineered for increasing the expression ot an 
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AR polypeptide [supra) in vivo 

Those skilled in the field of molecular biology will understand that any of a wide 
variety of expression systems may be used to prov.de the recombinant protein. The precise 
host cell used .s not critical to the invention The AR protein may be produced in a 
prokarvot.c host, for example. E colt, or in a eukaryotic host, for example, Saccharomyces 
cvrevisiac. mammalian cells , for example, COS 1 or NTH 3T3 cells., or any of a number of 
plant cells or whole plant including, without limitation, algae, tree spec.es. ornamental 
spec.es. temperate fruit spec.es. tropical fruit species, vegetable species, legume spec.es, 
cruder species, monocots, d.cots. or in any plant of commercial or agricultural significance. 
Particular examples of suitable plant hosts include, but are not limited to, conifers, petunia, 
tomato, potato, pepper, tobacco. Arabtdopsis. lettuce, sunflower, oilseed rape, flax, cotton, 
sugarbeet. celery, soybean, alfalfa. Medicago, lotus. Vigna. cucumber, carrot, eggplant, 
cauliflower, horseradish, morn.ng glory, poplar, walnut, apple, grape, asparagus, cassava, 
nee, maize, millet, onion, barley, orchard grass, oat, rye, and wheat. 

Such cells are available from a wide range of sources including the American Type 
Culture Collection (Rockland, MD); or from any of a number seed compan.es. for example, 
W Atlee Burpee Seed Co. (Warminster, PA), Park Seed Co. (Greenwood, SC), Johnny Seed 
Co (Alb.on. ME), or Northrup King Seeds (Harstvillc, SC). Descnpt.ons and sources of 
useful host cells are also found in Vasil Cell Culture and Somatic Cell Genetics of 
Plants, Vol 1. II. Hi Laboratory Procedures and Their Applications Academic Press. New 
York. 1984; Dixon, R.A., Plant Cell Culture- A Practical Approach, IRL Press. Oxford 
University, 1985; Green et al , Plant Tissue and Cell Culture, Academic Press, New York, 
1987; and Gasser and Fraiey, Science 244:1293, 1989 

Fur prokarvotic expression, DNA encoding an AR polypeptide is earned on a vector 
operably linked to control signals capable of effecting expression in the prokaryotic host. If 
desired, the coding sequence may contain, at its 5' end, a sequence encoding any of the 
known signal sequences capable of effecting secretion of the expressed protein into the 
penplasm.c space of the host cell, thereby facilitating recovery of the protein and subsequent 
purification Prokaryotes most frequen.lv used are various strains of E coir, however, other 
microbial strains may also be used. Plasm.d vectors are used which contain replication 
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on ems. selectable markers, and control sequences derived from a species compatible with the 
microbial host. Examples of such vectors are found in Pouwels et al. (supra) or Ausubel et 
a:, (supra) Commonly used prokaryotic control sequences (also referred to as "regulatory 
elements' 1 ) are defined herein to include promoters for transcription initiation, optionally with 
5 an operator, along with ribosomc binding site sequences Promoters commonly used to direct 
protein expression include the beta-lactamase (penicillinase), the lactose ilac) (Chang et al.. 
Xature 198:1056. 19~7), the tryptophan (Trp) (Goeddel et al., Xuc! Acids Res 8:4057, 
1980). and the tac promoter systems, as well as the lambda-derived P, promoter and N-genc 
nbosome binding site (Simatake et al., Xature 292: 12S. 1981 ). 

10 One particular bacterial expression system tor AR polypeptide production is the /:. 

coli pET expression system (Novagen. Inc.. Madison. WI) According to this expression 
system. DNA encoding an AR polypeptide is inserted into a pET vector in an orientation 
designed to allow expression. Since the AR gene is under the control of the T7 regulatory 
signals, expression of AR is induced by inducing the expression of T7 RNA polymerase in 

1 5 the host cell. This is typically achieved using host strains uhich express T 7 RNA polymerase 
in response to IPTG induction. Once produced, recombinant .AR polypeptide is then isolated 
according to standard methods known in the art, for example, those described herein. 

Another bacterial expression system for AR polypeptide production is the pGEX 
expression system (Pharmacia). This system employs a GST gene fusion system which is 

20 designed for high-level expression of genes or gene fragments as fusion proteins with rapid 
purification and recovery of functional gene products. The protein of interest is fused to the 
carboxvl terminus of the glutathione S-transferase protein from Schistosoma japonicum and 
is readiiv purified from bacterial lysatcs by affinity chromatography using Glutathione 
Sepharose 4B. Fusion proteins can be recovered under mild conditions by elution with 

25 glutathione. Cleavage of the glutathione S-transferase domain from the fusion protein is 
facilitated by the presence of recognition sites for site-specific proteases upstream of this 
domain. For example, proteins expressed m pGEX-2T plasmids may be cleaved with 
thrombin; those expressed in pGEX-3X may be cleaved with factor Xa 

For eukaryotic expression, the method of transformation or transfection and the 

30 choice of vehicle for expression of the .AR polypeptide will depend on the host system 



WO 98/06748 I-CT/US97/1 J9W 



-49- 

selected. Transformation and iransfecuon methods are desenbed, e.g . m Ausubel et al. 
isupra); Weissbach and Weissbach, Methods for Plant Molecular Biotogw Academic Press, 
1989; Gclvin et al.. Plant Molecular Biology- Manual, kiuwer Academic Publishers, 1990; 
Kindle, k.. Proc. Sail Acad Sci . ISA S^:122S, 1990, Potrykus, 1 . Annu. Rev. Plant 
5 Physiol. Plant Mol Biology 42:205. 1991; and BioRad (Hercules. CA) Technical Bulletin 
-1687 (Biohstic Particle Delivery Systems). Expression vehicles may be chosen from those 
provided, e.q., tn Cloning Vectors: A Laboratory* Manual (P.H. Pouwels et al.. 1985, Supp. 
19S7); Gasscr and Fraley {supra): Clontech Molecular Biology Catalog (Catalog 1992/93 
Tools for the Molecular Biologist. Palo Alto, CA), and the references cited above. Other 

10 expression constructs are desenbed by Fraley et a!. (U S Pat No. 5.352,605). 
Construction of Plant Transgenes 

Most preferably, an AR polypeptide is produced by a stably-transfected plant cell line, 
a transiently-transfected plant cell line, or by a transgenic plant. A number of vectors suitable 
for stable or extrachromosomal transfection of plant cells or for the establishment of 

1 5 transgenic plants are available to the public; such vectors are described in Pouwels et al. 
{supra), Weissbach and Weissbach (supra), and Gelvin et al. (supra). Methods for 
constructing such cell lines are desenbed in, e.g., Weissbach and Weissbach (supra), and 
Gelvin et al. (supra). 

Typically, plant expression vectors include ( 1 ) a cloned plant gene under the 

20 transcriptional control of 5' and 3' regulatory sequences and (2) a dominant selectable marker. 
Such plant expression vectors may also contain, it desired, a promoter legulatory region (for 
example, one conferring inducible or constitutive, pathogen- or wound-induced, 
environmentally- or dcvelopmentally-regulated, or ceil- or tissue-specific expression), a 
transcription initiation start site, a nbosome binding sue, an RNA processing signal, a 

25 transcription termination site, and/or a polyadenyiation signal. 

Once the desired AR nucleic acid sequence is obtained as desenbed above, it may be 
manipulated in a variety of ways known in the art. For example, where the sequence involves 
non-coding flanking regions, the flanking regions may be subjected to mutagenesis. 

The AR DNA sequence of the invention may, if desired, be combined with other 

3<i DNA sequences in a variety of ways The AR DNA sequence of the invention may be 
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employed with a!: or pan of the gene sequences normally associated with the AR protein In 
its component parts, a DNA sequence encoding an .AR protein is combined in a DNA 
construct having a transcription initiation control region capable of promoting transcnption 
and translation ir. a host cell 
5 In general, the constructs will involve regulatory regions functional in plants which 

provide for modified production of AR protein as discussed herein. The open reading frame 
codme for the AR protein or functional fragment thereof will be joined at its 5' end to a 
transcription initiation regulatory region such as the sequence naturally found in the 5* 
upstream region of the AR structural gene. Numerous other transcription initiation regions 

1 0 arc available which provide for constitutive or inducible regulation 

For applications where developmental, cell, tissue, hormonal, or environmental 
expression is desired, appropriate 5' upstream non-coding regions are obtained from other 
genes, for example, from genes regulated during menstem development, seed development, 
embryo development, or leaf development. 

1 5 Regulatory' transcript termination regions may also be provided in DNA constructs of 

this invention as well. Transcript termination regions may be provided by the DNA sequence 
encoding the AR protein or any convenient transcnption termination region derived from a 
different gene source. The transcript termination region will contain preferably at least 1-3 
kb of sequence 3' to the structural gene from which the termination region is derived. Plant 

20 expression constructs having AR as the DNA sequence of interest for expression (in either 
the sense or antisense orientation) may be employed with a wide variety of plant life, 
particularly plant life involved in the production of storage reserves (for example, those 
involving carbon and nitrogen metabolism). Such genetically engineered plants are useful 
for a vanetv of industrial and agricultural applications as discussed infra Importantly, this 

25 invention is applicable to dicotyledons and monocotyledons, and will be readily applicable to 
any new or improved transformation or regeneration method. 

The expression constructs include at least one promoter opcrably linked to at least one 
AR gene. An example of a useful plant promoter according to the invention is a cauhmovirus 
promoter, for example, a cauliflower mosaic virus (CaMV ) promoter. These promoters 

30 confer high levels of expression in most plant tissues, and the activity of these promoters is 
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not dependent on viraliy encoded proteins. CaMV is a source for both the 35S and 19S 
promoters Examples of plant expression constructs using these promoters are found in 
Fraley et al., U S. Pat. No. 5.352.605. In most tissues of transgenic plants, the CaMV 35S 
promoter is a strong promoter ( see, e.g.. Odell et al., Nature 313.810. 1985). The CaMV 
promoter is also highly active in monocots (see, e.g.. Dekevscr et al.. Plant Cell 2:591, 1990; 
Terada and Shimamoto, Moi Gen. Gene!. 220:389, 1990). Moreover, activity of this 
promoter can be further increased (i.e., between 2-10 fold) by duplication of the CaMV 35S 
promoter (see e.g.. Kay et al.. Science 236:1299, 1987; Ow ct al., Proc. Natl. Acad. Sci., 
U.S.A. 84:4870. 1987, and Fang et al.. Plant Cell 1:141, 1989, and McPherson and Kay, U.S. 
Pat. No. 5,378,142). 

Other useful plant promoters include, without limitation, the nopaline synthase (NOS) 
promoter (An et al.. Plant Physiol. 88:547, 1988 and Rodgers and Fraley, U.S. Pat. No. 
5,034,322), the octopine synthase promoter (Fromm et al.. Plant Cell 1 :977, 1989), figwort 
mosiac virus (FMV) promoter {Rodgers, U.S. Pat. No. 5,378,619), and the rice actin 
promoter (Wu and McElroy, W09 1/09948). 

Exemplary monocot promoters include, without limitation, commelina yellow mottle 
virus promoter, sugar cane badna virus promoter, nee tungro bacilliform virus promoter, 
maize streak virus element, and wheat dwarf virus promoter. 

For certain applications, it may be desirable to produce the AR gene product in an 
appropriate tissue, at an appropriate level, or at an appropriate developmental time. For this 
purpose, there are an assortment of gene promoters, each with its own distinct characteristics 
embodied in its regulatory sequences, shown to be regulated in response to inducible signals 
such as the environment, hormones, and/or developmental cues These include, without 
limitation, ^ene promoters that are responsible for heat-reguiated gene expression (see, e.g., 
Calhs et al.. Plant Physiol. 88:965, 1988; Takahashi and Komeda, Moi Gen. Genet. 219:365. 
1989, and Takahashi et al. Plant J. 2:751, 1992), light-regulated gene expression (e.g., the 
pea rhcSSA described by Kuhlemeier et al., Plant Cell 1 :47 1 , 1 989; the maize rbcS promoter 
described by Schiiffner and Sheen, Plant Cell 3:997, 1991 , the chlorophyll a/b-binding 
protein gene found in pea described by Simpson et al.. FMB( ) J 4:2723. I 985; the Arabssu 
promoter: or the rice rbs promoter), hormone-regulated gene expression (for example, the 
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abscisic acid (ABA) response e sequences from the Em gene of wheat described by Marcotte 
et a!., Plant Cell 1 :969. 1989; the AB A-inducible HVA1 and HYA22, and rd29A promoters 
described for barley and Arabidopsis by Straub et a!.. Plant Cell 6:61 7, 1994 and Shen et al.. 
Plant Cell 7:295, 1995; and wound-induced gene expression (for example, of wunl described 
5 by Siebertz et al.. Plant Cell 1 :961, 1989). organ-specific gene expression {for example, of 
the tuber-specific storage protein gene described by Roshal et al.. EMBO J 6:1 155. 1987, the 
23-kDa zein gene from maize described by Schernthaner et al., EMBO J. 7: 1 249, 1 988; or the 
French bean 6-phaseolm gene described by Bustos et al., Plant Cell 1 :839, 1989), or 
pathogen-inducible promoters ( for example, PR- 1 , prp- 1 , or P- 1 .3 glucanase promoters, the 

10 fungal-inducible wirla promoter of wheat, and the nematode-inducible promoters. TobRB 7 - 
5 A and Hmg-1, of tobacco and parsley, respectively) 

Plant expression vectors may also optionally include RNA processing signals, e.g, 
introns, which have been shown to be important for efficient RNA synthesis and 
accumulation (Callis et al.. Genes and Dev. 1:1 1 S3. 1987), The location of the RNA splice 

1 5 sequences can dramatically influence the level of transgene expression in plants- in view of 
this fact, an intron may be positioned upstream or downstream of an AR polypcptide- 
encoding sequence in the transgene to modulate levels of gene expression. 

In addition to the aforementioned 5 ? regulatory control sequences, the expression 
vectors may also include regulatory control regions which are generally present in the 3' 

20 regions of plant genes (Thornburg et al., Proc Natl. Acad. Sci. U.S.A. 84:744, 1987; An et 
al.. Plant Cell 1:1 15, 1989). For example, the 3' terminator region may be included in the 
expression vector to increase stability of the mRNA. One such terminator region may be 
derived from the PI-II terminator region of potato. In addition, other commonly used 
terminators are derived from the octopine or nopahne svnthase signals. 

25 The plant expression vector also typically contains a dominant selectable marker gene 

used to identify those cells that have become transformed Useful selectable genes for plant 
systems include genes encoding antibiotic resistance genes, for example, those encoding 
resistance to hygromycin, kanamycin, bleomycin, G41 8, streptomycin, or spectinomycin. 
Genes required for photosynthesis may also be used as selectable markers in photosynthetic- 

30 deficient strains. Finally, genes encoding herbicide resistance may be used as selectable 
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markers; useful herbicide resistance genes include the bar gene encoding the enzyme 
phosphmothncin acetyltransferase and conferring resistance to the broad spectrum herbicide 
Basta'S 1 (Hoechst AG. Frankfurt, Germany). 

Efficient use of selectable markers is facilitated by a determination of the 
5 susceptibility of a plant cell to a particular selectable agent and a determination of the 

concentration of this agent which effectively kills most, if not all, of the transformed cells. 
Some useful concentrations of antibiotics for tobacco transformation include, e.g., 75-100 
Hg mL (kanamycin), 20-50 ug/mL (hygromycin), or 5-10 ug/mL (bleomycin). A useful 
strategy for selection of transfonnants for herbicide resistance is described, e.g., by Vasil et 
! n a!., supra 

In addition, if desired, the plant expression construct may contain a modified or fully- 
synthetic structural AR coding sequence which has been changed to enhance the performance 
of the ^ene in plants. Methods for constructing such a modified or synthetic gene are 
described in Fischoff and Perlak, U.S. Pat. No 5.500,365. 

1 5 It should be readily apparent to one skilled in the art of molecular biology, especially 

in the field of plant molecular biology, that the level of gene expression is dependent, not 
only on the combination of promoters, RXA processing signals, and terminator elements, but 
also on how these elements are used to increase the levels of selectable marker gene 
expression. 

20 P lant Transformation 

Upon construction of the plant expression vector, several standard methods are 
available for introduction of the vector into a plant host, thereby generating a transgenic 
plant. These methods include (1 ) Agrobacterium-mcdiated transformation (A. tumefaciens or 
.■i. rhizozencs) (see, e.g., Lichtensfein and Fuller In: Cenetic Engineering, vol 6. PWJ Rigby, 

25 ed, London, Academic Press, 1987; and Lichtenstein. CP., and Draper, J,. In: DNA Clonings 
Vol II, D M. Glover, ed, Oxford, IRI Press, 1985)), (2) the particle delivery system (see, e.g., 
Gordon-Kamm et al.. Plant Cell 2:603 (1990); or BioRad Technical Bulletin 1687, supra), 
(3) microiniection protocols (see, e.g.. Green et al.. supra), (4) polyethylene glycol (PEG) 
procedures (see. e g , Draper et al.. Plant Cell Phvsiol 23*451 . 1982; or e.g . Zhang and Wu, 

3<» Theor Appl Cenei 76:835, 1988), (5) liposome-mediated DNA uptake (see, e.g.. Freeman et 
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ai.. Plant Cell Physiol. 25:1353. 19S4), (0) electroporation protocols i see, e g.. Geivm et al.. 
supra: Dekeyser et al., supra: Fromm et aL Suture 319:7^1, 198b, Sheen Plant Cell 2:102~\ 
1990; or Jang and Sheen Plant Cell 6:16to. 1994), and (7) the voncxing method (see, e.g.. 
Kindle supra ). The method of transformation is not critical to the invention. Any method 
which provides for efficient transformation may be employed. As newer methods are 
available to transform crops or other host cells, they may be directly applied Suitable plants 
for use in the practice of the invention include, but are not limited to, sugar cane, wheal, rice, 
maize, sugar beet, potato, barley, manioc, sweet potato, soybean, sorghum, cassava, banana, 
erape, oats, tomato, millet, coconut, orange, rye, cabbage, apple, watermelon, canola, cotton, 
carrot, garlic, omen, pepper, strawberry, yam, peanut, onion, bean, pea, mango, citrus plants, 
walnuts, and sunflower. 

The following is an example outlining one particular technique, an Agrobactenum- 
mediated plant transformation. By this technique, the general process for manipulating genes 
to be transferred into the genome of plant cells is carried out in two phases. First, cloning and 
DNA modification steps are earned out in E. colt, and the plasmid containing the gene 
construct of interest is transferred by conjugation or electroporation into Agrobacterium. 
Second, the resulting Agrobacterium strain is used to transform plant cells. Thus, for the 
generalized plant expression vector, the plasmid contains an origin of replication that allows 
it to replicate in Agrobacterium and a high copy number origin of replication functional in E 
coir This permits facile production and testing of transgenes in E. colt prior to transfer to 
Agrobacterium for subsequent introduction into plants. Resistance genes can be carried on 
the vector, one for selection in bacteria, for example, streptomycin, and another that will 
function in plants, for example, a gene encoding kanamycin resistance or herbicide resistance 
Also present on the vector are restriction endonuclease sites for the addition of one or more 
transgenes and directional T-DNA border sequences which, when recognized by the transfer 
functions of Agrobacterium, delimit the DNA region that will be transferred to the plant. 

In another example, plant cells may be transformed by shooting into the cell tungsten 
microprojectilcs on which cloned DNA is precipitated. In the Biolistic Apparatus (Bio-Rad) 
used for the shooting, a gunpowder charge (22 caliber Power Piston Tool Charge) or an air- 
dnven blast drives a plastic macroprojectile through a gun barrel. An aliquot of a suspension 
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ot tunesten particles on which DNA has been precipitated is placed on the front of the plastic 
macroprojectile. The latter is fired at an acrylic stopping plate that has a hole through it that 
is too smali for the macroprojectile to pass through. As a result, the plastic macroprojectile 
smashes against the stopping plate, and the tungsten microprojectiles continue toward their 
target through the hole m the plate. For the instant invention the target can be any plant ceil, 
tissue, seed, or embryo. The DNA introduced into the cell on the microprojectiles becomes 
integrated into either the nucleus or the chioroplast. 

In senerak transfer and expression of transgenes in plant cells are now routine 
practices to those skilled in the art, and have become major tools to carry out gene expression 
studies in plants and to produce improved plant varieties of agricultural or commercial 
interest. 

Transgenic Plant Regeneration 

Plant cells transformed with a plant expression vector can be regenerated, for 
example, from single cells, callus tissue, or leaf discs according to standard plant tissue 
culture techniques. It is well known in the art that various cells, tissues, and organs from 
almost any plant can be successfully cultured to regenerate an entire plant; such techniques 
are described, e.g., in Vasil supra\ Green et al ., supra; Weissbach and Weissbach, supra\ and 
Gelvin et al., supra 

In one particular example, a cloned AR polypeptide construct under the control of the 
35S CaMV promoter and the nopahne synthase terminator and carrying a selectable marker 
(for example, kanamycin resistance) is transformed into Agrobacterium. Transtormation ot 
leaf discs (for example, of tobacco or potato leaf discs), with vector-containing 
Agrobacterium is earned out as described by Horsch et al. (Science 227:1229, 1985) 
Putative Transforrnants are selected after a few weeks (for example. 3 to 5 weeks) on plant 
tissue culture media containing kanamycin (e.g. I Ou ng/mL). Kanamycin-resistant shoots arc 
then placed on plant tissue culture media without hormones for root initiation. Kanamycin- 
resistant plants are then selected for greenhouse growth. If desired, seeds from self-fertilized 
transgenic plants can then be sowed in a soil-less medium and grown in a greenhouse. 
Kanamvcin-reMstant progeny arc selected by sowing surfaced sterilized seeds on hormone- 
tree kanamvcin-containing media. Analysis for the integration of the transgene is 
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accomplished by standard techniques isee. fc: example, Ausubcl et al. supra, Gelvin et a! 
supra). 

Transgenic plants expressing the selectable marker are then screened for transmission 
of the transgene DNA by standard immunoblot and DNA detection techniques. Each positiv e 
5 transgenic plant and its transgenic progeny are unique in comparison to other transgenic 

plants established with the same transgene Integration of the transgene DNA into the plant 
genomic DNA is m most cases random, and die site of integration can profoundly affect the 
levels and the tissue and developmental patterns of transgene expression. Consequently, a 
number of transgenic lines are usually screened for each transgene to identity and select 

10 plants with the most appropriate expression profiles. 

Transgenic lines are evaluated for levels of transgene expression. Expression at the 
RNA level is determined initially to identify and quantitate expression-positive plants. 
Standard techniques for RNA analysis are employed and include PCR amplification assays 
us:ng oligonucleotide primers designed to amplify only transgene RNA templates and 

1 5 solution hybridization assays using transgene-specific probes ( see, e.g., Ausubel et ah, 
supra). The RNA-positive plants are then analyzed for protein expression by Western 
immunoblot analysis using AR specific antibodies (see, e.g., Ausubcl et al., supra). In 
addition, /// situ hybridization and immunocytochemistry according to standard protocols can 
be done using transgene-specific nucleotide probes and antibodies, respectively, to localize 

20 sues of expression within transgenic tissue. 

Ectopic expression of AR genes is useful for the production of transgenic plants 
having an increased level of resistance to disease-causing pathogens. 

In addition, if desired, once the recombinant AR protein is expressed m any cell or in 
a transgenic plant (for example, as described above), it may be isolated, e.g., using affinity 

25 chromatography. In one example, an anti-AR polypeptide antibody (e.g., produced as 

desenbed in Ausubel et al., supra, or by any standard technique) may be attached to a column 
and used to isolate the polypeptide. Lysis and fractionation of AR-producing cells prior to 
affmitv chromatography may be performed by standard methods (see, e.g.. Ausubel et al., 
supra). Once isolated, the recombinant protein can, if desired, be further purified, for 

30 example, by high performance liquid chromatography (see. e.g.. Fisher. Laboratory 
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Techniques In Biochemistry And Molecular Bioiogw eds.. Work and Burden, Elsevier, 1980). 

Polypeptides of the invention, particularly shori AR protein fragments, can also be 
produced bv chemical synthesis (e.g.. by the methods descr.bed in Solid Phase Peptide 
Sxnthcsis, 2nd ed., K>S4 The Pierce Chemical Co., Rockford. IL) These general techniques 
of polypeptide expression and purification can also be used to produce and isolate useful AR 
fragments or analogs. 

Ectopic Expression of AR Genes for Engineering Plant Defense Responses to Pathogens 

As discussed above, plasmid constructs designed for the expression of AR gene 
products are useful, for example, for activating plant defense pathways that confer anti- 
pathogenic properties to a transgenic plant. AR genes that arc isolated from a host plant (e.g.. 
Arabidopsis or Nicotiana) may be engineered for expression in the same plant, a closely 
related species, or a distantly related plant species. For example, the cruciferous Arabidopsis 
SPR1 gene may be engineered for constitutive low level expression and then transformed 
into an Arabidopsis host plant. Alternatively, the Arabidopsis NPR1 gene may be engineered 
for expression in other cruciferous plants, such as the Brassicas ( for example, broccoli, 
cabbage, and cauliflower). Similarly, the NPR1 homolog of Nicotiana glutinosa is useful for 
expression in related solanaceous plants, such as tomato, potato, and pepper. To achieve 
pathogen resistance, it is important to express an .AR protein at an effective level. Evaluation 
of the level of pathogen protection conferred to a plant bv ectopic expression of an AR gene 
is determined according to conventional methods and assays. 

In one working example, constitutive ectopic expression ot the NPR1 gene of 
Arabidopsis (Fig. 5, SEQ ID NO:2) or the NPR I homolog of Nicotiana glutinosa 
(Fig. 7A; SEQ ID NO: 13) in Russet Burbank potato is used to control Phytophthora infestans 
infection. In one particular example, a plant expression vector is constructed that contains an 
NPR1 cDNA sequence expressed under the control of the enhanced CaMV 35S promoter as 
described by McPherson and Kay (U.S. Patent 5,359,142). This expression vector is then 
used to transform Russet Burbank according to the methods described in Fischhoff et al. 
(U.S. Patent 5,500,365). To assess resistance to fungal infection, transformed Russet 
Burbank and appropriate controls are crown to approximately eiuht-wceks-old, and leaves 
(for example, the second or third from the top of the plant) are inoculated with a mycelial 
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suspension of P infestans Plugs of P infcstans myceha arc inoculated on each side of the 
leaf midvein Plants are subsequently incubated in a growth chamber at 2" C with constant 
fluorescent light 

Leaves of transformed Russet Burbank and control plants are then evaluated for 
resistance to P. infcstans infection according to conventional experimental methods For this 
evaluation, the number of lesions per leaf and percentage of lea: area infected are recorded 
cverv f twenty-four hours for seven days after inoculation. From these data, levels of 
resistance to P infestans are determined. Transformed potato plants that express an NPRl 
eenc having an increased level of resistance to P infcstans relative to control plants are taken 
as being useful in the invention. 

Alternatively, to assess resistance at the whole plant level, transformed and control 
plants are transplanted to potting soil containing an inoculum of P. infestans. Plants are then 
evaluated for symptoms of fungal infection < for example, wilting or decayed leaves) over a 
period of time lasting from several days to weeks. Again, transformed potato plants 
expressing the NPR1 gene having an increased level of resistance to the fungal pathogen, P 
infestans, relative to control plants are taken as being useful in the invention. 

In another working example, expression of the NPRl homolog of Nicotiana glutinosa 
in tomato is used to control bacterial infection, for example, to Pseudomonas svringae. 
Specifically, a plant expression vector is constructed that contains the cDNA sequence of the 
NPRl homolog from Nicotiana glutinosa (Fig. 7 A; SEQ ID NO:l 3) which is expressed 
under the control of the enhanced CaMV 35S promoter as described by McPherson and Kay, 
supra. This expression vector is then used to transform tomato plants according to the 
methods described in Fischhoff ct ah, supra. To assess resistance to bacterial infection, 
transformed tomato plants and appropriate controls are grown, and their leaves are inoculated 
with a suspension of P. svringae according to standard methods, for example, those described 
herein. Plants are subsequently incubated in a growth chamber, and the inoculated leaves are 
subsequently analyzed for signs of disease resistance according to standard methods. For 
example, the number of chlorotic lesions per leaf and percentage of leaf area infected are 
recorded and evaluated after inoculation From a statistical analysis of these data, levels of 
resistance to P. svringae are determined. Transformed tomato plants that express an NPRl 
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homolot: of Sicotiana gluanosa gene hav ing an increased level of resistance to P. svringae 
relative to control plants are taken as being useful in the invention. 

In still another working example, expression of the SPR1 homolog of rice is used to 
control funeal diseases, for example, the infection of tissue by Mugnaponhc grisea* the cause 
5 of rice blast. In one particular approach, a plant expression v ector is constructed that contains 
the cDNA sequence of the rice SPR1 homolog that is constitutively expressed under the 
control of the rice actin promoter described by Wu el al. (WO 91 09948) This expression 
vector is then used to transform rice plants according to conventional methods, for example, 
using the methods described in Hiei et al. (Plant Journal 6:271-282, 1994). To assess 

I o resistance to fungal infection, transformed rice plants and appropriate controls are grown, and 
their leaves are inoculated with a mycelial suspension of A/ grisea according to standard 
methods. Plants are subsequently incubated in a growth chamber and the inoculated leaves 
are subsequently analyzed for disease resistance according to standard methods. For 
example, the numbei of lesions per leaf and percentage of leaf area infected are recorded and 

1 5 evaluated after inoculation. From a statistical analysis of these data, levels of resistance to M 
grisea are determined. Transformed nee plants that express a nee NPR1 homolog having an 
increased level of resistance to M. grisea relative to control plants are taken as being useful in 
the invention. 

20 AR Interacting Polypeptides 

The isolation of AR sequences aiso facilitates the identification of polypeptides which 
interact with the AR protein. Such polypeptide-cncoding sequences are isolated by any 
standard two hybrid system (see, for example, Fields et a!., Nature 340:245-246, 1989; Yang 
et bioscience 257:680-682, i 992; Zcrvos et al., Cell "2:223-232, 1993). For example, all or 

25 a part of the AR sequence may be fused to a DNA binding domain (such as the GAL4 or 
Lex A DNA binding domain). After establishing that this fusion protein does not itself 
activate expression of a reporter gene (for example, a lacZ or LEU2 reporter gene) bearing 
appropriate DNA binding sites, this fusion protein is used as an interaction target. Candidate 
interacting proteins fused to an activation domain ( for example, an acidic activation domain) 

30 are then co-cxpressed with the AR fusion in host ceils, and interacting proteins are identified 
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bv their ability to contact the AR sequence and stimulate reporter gene expression. AR- 
interactmg proteins identified using this screening method provide good candidates for 
proteins that arc involved in the acquired resistance signal transduction pathway. 
Antibodies 

AR polypeptides described herein (or lmunogemc fragments or analogs) may be used 
to raise antibodies useful in the invention; such polypeptides may be produced by 
recombinant or peptide synthetic techniques (see, e.g.. Solid Phase Peptide Synthesis. 2nd 
ed , 1 984, Pierce Chemical Co., Rockford, IL, Ausubel et al., supra). The peptides may be 
coupled to a carrier protein, such as KLH as described in Ausubel et al. supra. The KLH- 
peptide is mixed with Freund's adjuvant and injected into guinea pigs, rats, or preferably 
rabbits. Antibodies may be purified by peptide antigen affinity chromatography. 

Monoclonal antibodies may be prepared using the AR polypeptides desenbed above 
and standard hybridoma technology (see, e.g., Kohlcr et al.. Suture 256:495. 1975; Kohler et 
al., Eur. J. Immunol. 6:51 1, 19^6; Kohler et al., Eur. J Immunol 6:292, 1976; Hammerling et 
al.. In Monoclonal Antibodies and T Cell Hvbndomas, Elsevier, NY, 1 98 I ; Ausubel et al., 
supra) 

Once produced, polyclonal or monoclonal antibodies are tested for specific AR 
recognition by Western blot or immunoprecipitation analysis (by the methods desenbed in 
Ausubel et al., supra). Antibodies which specifically recognize AR polypeptides are 
considered to be useful in the invention; such antibodies may be used, e.g.. in an 
immunoassay to monitor the level of AR polypeptide produced by a plant. 
Use 

The invention desenbed herein is useful lor a variety of agricultural and commercial 
purposes including, but not limited to. improving acquired resistance against plant pathogens, 
increasing crop yields, improving crop and ornamental quality, and reducing agricultural 
production costs. In particular, ectopic expression of an AR gene in a plant cell provides 
acquired resistance to plant pathogens and can be used to protect plants from pathogen 
infestation that reduces plant productivity and viabilit\ 

The invention also provides for broad-spectrum pathogen resistance by facilitating the 
natural mechanism of host resistance. For example, AR iransgenes can be expressed in plant 
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cells at sufficiently high levels to initiate an acquired resistance plant defense response 
constitutively in the absence of signals from the pathogen. The level of expression associated 
with such a plant defense response may be determined by measuring the levels of defense 
response eene expression as described herein or according to any conventional method. If 
desired, the AR transgenes are expressed by a controllable promoter such as a tissue-specific 
promoter, cell-type specific promoter, or by a promoter that is induced by an external signal 
or a^ent such as a pathogen- or wound-inducible control element, thus limiting the temporal 
or tissue expression or both of an acquired resistance defense response. The AR genes may 
also be expressed in roots, leaves, or fruits, or at a site of a plant that is susceptible to 
pathogen penetration and infection. 

The invention is aiso useful for controlling plant disease by enhancing a plant's SAJ< 
defense mechanisms. In particular, the invention is useful for combating diseases known to 
be inhibited by plant SAR defense mechanisms. These include, without limitation, viral 
diseases caused by TMV and TNV, bacterial diseases caused by Pscudomonas and 
Xanthomonas, and fungal diseases caused by Erysiphe* Peronospora. Phytophthora, 
Colletotnchum, and Magnaporthe gnsea. In particular exemplary approaches, constitutive or 
inducible expression of an AR gene in a transgenic plant is useful for controlling powdery 
mildew of wheat caused by Erysiphe* bacterial leaf spot of pepper caused by Xanthomonas 
campestris* bacterial wilt and bacterial spot of tomato caused by Pseudomonas synngae and 
Xanthomonas tampestris. and bacteria! blights of citrus and walnut caused by Xanthomonas 

cam pes tr is . 

Other Embodiments 
The invention further includes analogs of any naturalk -occurring plant AR 
polypeptide. Analogs can differ from the naturally-occurring AR protein by amino acid 
sequence differences, by post-translational modifications, or by both. Analogs of the 
invention will generally exhibit at least 40%. more preferably 50%. and most preferably 60° o 
or even having 70%, 80%, or 90% identity with all or part of a naturally-occurring plant AR 
ammo acid sequence The length of sequence comparison is at least 15 amino acid residues, 
preferablv at least 25 ammo acid residues, and more preferably more than 35 amino acid 
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rcsidues. Modifications include in vi\r> and in vitro chemical den vatization of polypeptides, 
e.g . acetvlation. carhoxylation, phosphorylation, or gkcosylation; such modifications may 
occur dunng polypeptide synthesis or processing or following treatment with isolated 
modifying enzymes. Analogs can also differ from the naturally-occurring AR polypeptide by 
alterations in primary sequence. These include genetic variants, both natural and induced (for 
example, resulting from random mutagenesis by irradiation or exposure to ethyl 
methvisulfate or by site-specific mutagenesis as described in Sambrook, Fritsch and Mamatis. 
Molecular Cloning A Laboratory Manual (2d ed j, CSH Press. 1 089, or Ausubei et ah. 
supra). Also included are cych/ed peptides, molecules, and analogs which contain residues 
other than L-ammo acids, e.g.. D-ammo acids or non-naturally occurring or synthetic amino 
acids, e.g., P or y amino acids. 

In addition to full-length polypeptides, the invention also includes AR polypeptide 
fragments. As used herein, the term "fragment/' means at least 20 contiguous amino acids, 
preferably at least 30 contiguous amino acids, more preferably at least 50 contiguous ammo 
acids, and most preferably at least 60 to 80 or more contiguous amino acids. Fragments of 
AR polypeptides can be generated by methods known to those skilled in the an or may result 
from normal protein processing (e.g., removal of ammo acids from the nascent polypeptide 
that are not required for biological activity or removal of amino acids by alternative mRNA 
splicing or alternative protein processing events). In preferred embodiments, an AR 
polypeptide fragment includes an ankynn-repeat motif as described herein. In other preferred 
embodiments, an AR fragment is capable of interacting with a second polypeptide component 
of the AR signal transduction cascade. 

Furthermore, the invention includes nucleotide sequences that facilitate specific 
detection of an AR nucleic acid Thus, AR sequences described herein or portions thereof 
may be used as probes to hybridize to nucleotide sequences from other plants (e.g., dicots. 
monocots, gymnosperms, and algae) by standard hybridization techniques under conventional 
conditions. Sequences that hybridize to an AR coding sequence or its complement and that 
encode an AR polypeptide are considered useful in the invention. As used herein, the term 
"fragment," as applied to nucleic acid sequences, means at least 5 contiguous nucleotides, 
preferably at least 10 contiguous nucleotides, more preferably at least 20 to 30 contiguous 
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nucleotides. and most preferably at least 40 to SO or more contiguous nucleotides. Fragment 
of AR nucleic acid sequences can be generated by methods known to those skilled in the art. 



Deposit 

Cosmids 2 I A4-2- 1,21 A4-4-3- 1 , 2 1 A4-P5- 1 have been deposited with the American 
Type Culture Collection on July 8, 1 996, and bear the accession numbers ATCC No. 97649, 
97650, and 9765 1 Plasmid pKExNTR 1 was deposited on July 3 1 . 1 996 and bears the 
accession number ATCC No 97671 Applicants acknowledge their responsibility to replace 
these plasmids should it loose viability before the end of the term of a patent issued hereon, 
and their responsibility to notify the American Type Culture Collection of the issuance of 
such a patent, at which time the deposit will be made available to the public. Prior to that 
time the deposit will be made available to the Commissioner of Patents under terms of 37 
CFR $ 1 .14 and 35 USC $ 112. These deposits are av ailable as required by foreign patent 
laws in countries wherein counterparts of this subject application, or progeny, are filed. It 
should be understood that availability of a deposit does not constitute a license to practice the 
subject invention 

Aii publications and patent applications mentioned in this specification are herein 
incorporated by reference to trie same extent as if each independent publication or paient 
application was specifically and individually indicated to be incorporated by reference 
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SEQUENCE LISTING 

( 1 ) GENERAL INFORMATION 

(i) APPLICANT: The General Hospital Corporation et ai. 

(ii) TITLE OF THE INVENTION: 

ACQUIRED RESISTANCE GENES AND USES THEREOF 

(in) NUMBER OF SEQUENCES: 28 

(iv ) CORRESPONDENCE ADDRESS. 

(A) .-ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 

(F.) COUNTRY: USA 
(F) ZIP: 021 10 



(v) COMPUTER READABLE FORM; 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT US97 — - 

(B) FILING DATE: 08-AUG-97 

(C) CLASSIFICATION: 

(vn) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60 023.85 1 

(B) FILING DATE: August 9. 1996 

(A) APPLICATION NUMBER: 60 035. 166 

(B) FILING DATE January 10. 1997 

(A) APPLICATION NUMBER: 60/046.769 

(B) FILING DATE: May 16. 1997 
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(vi:m ATTORNEY AGENT INFORMATION: 

(A) NAME: Elbing. Karen L 

(B) REGISTRATION NUMBER: 35,238 

(C) REFERENCE DOCKET NUMBER: 00786 '339W04 

(ix) TELECOMMUNICATION INFORMATION: 
{A) TELEPHONE: 617-428-0200 
(B) TELEFAX: 617-428-7045 



(2> INFORMATION FOR SEQ ID NOT: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(])) MOI ECU! F TYTF Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

A AGCTTGTG A TGCA AGTCAT GGGATATTGC TTTGTGTTA A GTAT AC A A A A CCATCACGTG 60 

GATACATAGT CTTCA AACCA ACCACIAAAC AG I A I CAGG T C ATACCA A AG CCAGAAGTGA i 20 

AGGGTTGGG A TATGTCATTG GGTTTAGCGG TAATCGG ATT GA ACCCTTTC CGG TATAAA A 1 80 

TACAA AGGCT TTCGCAGTCT CGGCGTATGT GTATGTCTCG GGGTATCTAC CATTTGAATC 240 

ACAGAACTTT TATGTGCG AA GTTTTCGATT CTGATTCGTT TACCTGGAAG AGATTAGAAA 300 

TTTGCGTCTA CCA A A A ACAG ACAGATTAAT TTTTTCCAAC CCG ATACAAG TTTCGGGGTT 360 

CTTGCATTGG ATATCACGG A ACA ACAATGT GATCCGGTTT TGTCTCA AA A CCGA AACTTG 420 

GTCCTTCTTC CATACTCCGA ACTCTOATOT TTTCTC AGG A TT AGTGAG AT ACGAAGGGAA 480 

GCTAGGTGCT ATTCGTCAGT GGACAAACAA AGATCAAGAA GATGTTCACG AGTTATGGG 7 540 

TTT \ A \ C AOC A C: TTTTG A A A A G TCG T GG O T 7" A A A G TG a a a hat A TT A a a a GfA TTG G A G7 600 

AGaTTTGaTT aCGTGGACTC CAAGGAACGjA CGTi GTaTTG TTTCGTAGTA GTGATCGTGG 660 

TTGCCTCTAC A ACATAAACG CAGAGAAGTT GAATTTAGTT TATGCAAAAA AAGAGGGATC 720 

TG A TTGTTGT TTCGTTTGTT TTCCG TTTTG TTCTGATTAC GAG AGGGTTG ATCTGAACGG 780 

A Af ■ A AGf" A AG GGGCCGACAC TTTAAAAAAA AAATAAAAAA AA TGGGCCGA CAAATGCAAA 840 

CG TaGTTGaC AAGGATCTCA AGTCTCAAGT CTCAATFGGC 7CGCTCA7TG TGGGGCATAA 900 

A r A TaTCTAG TGaTCTTTAA TTGTIT!T! A TAAGGTA AAA AGGA A TATTG A A TTTTG TTT 060 

CTTAGGTTTA TGTAATAATA CCAAACATTG TTTTATGAAT ATTTAATCTG ATTTTTTGGC !020 

TAGTTATTTT ATTATATCAA GGGTTCCTGT TTATAGTTGA A A ACAGTTAC TGTATAGAAA 1080 

ATAGTGTCCC AATTTTCTCT CTTA AATA AT ATATTAGTTA ATAAAAGATA TTTTAATATA I 140 

TT AG AT AT AC AATAATATCT A AAGCA ACAC ATATTTAG AC ACAACACGTA AT ATC7TAC7 1200 

A TTG TTT A CA TATATTTATA GCTTACCA AT ATAACCCGTA TCTATGTTTT ATAAGCTTTT 1 260 

AT A( AATATA TGTACGGTAT GCTGTCCACG TATATATATTCTCCAAAAAA AACGCATGGT 1320 

A LA C A A A A TT T ATT A A ATA T TTGGCAATTG GGTGTTTATC TA AAGTT7 AT CACA ATATTT 1380 

ATC VACTATA ATAG ATGGTA GAAGATAAAA AAATTATATC AG ATT G A TTG AA7TAAATTT 1440 

T A I A A T A T A T C A TTTT A A A A A ATTAATTAA AAGAAAACTA TTTCATAAAA TTGTTCAAAA 1 5U0 

OA ! AATTAGT AAAATTAATT A AATATGTGA TGCTATTGAA TTATAGAGAG TTATTGTAA A 1560 

f ! ; \L T I A A A A IL \l A C A A A TC7TATCCT A A TTT A A CTT A TC A TTT A A G A AATA C A A A A O ! 6 2 ' » 

T A A \ A A A CGG G G A A AGGAAT A A TTT A TTT A C CTTATTATA A( TCCTATAT AAAGTACTCT 1680 

G7TTA7TGAA GA I AATCTTA CGTTGTTGTA TTCA7 AGGCa TCTTTAACCT ATCTTTTCaT | 740 
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icgm"c;tta accaaatcca gttgataagg tctcttcgtt gattagc aga gatct tt; a mi 
a thgtg a at ttcaahcat cgga acctgt tg atg g ac acc acc att gat gg a ! . 3 

Met Asp I hr 1 hr He Asp Gi> 

; 5 

I TC GCC GAT TCT TAT G A A ATG AGG AGC ACT AGT TTC GTC GGT ACC GA'I G»i 
Pne AG Asp Scr 1 \r gIj !!e Ser Scr Th' Scr Phe \ al Ala Thr Asp 

;o 15 2'> 

A AC ACC GAC TCC TCT ATT GTT TAT CTG GCC GCC GA A C AA GTA CTC ACC 209 
Asn Lg Asp Ser Scr lie Val T>r Lea Ala Ala Giu Gin Ya! Leu Thr 

-< < \ r) 3 v 

uuuVI OATcTA TCT OCT «TG CAA TTG CTC TCC A A v ' A G C ITCCiAA ICC 25^ 
Gr. i-ro Asp \ ai Ser AG Leu Gin :.cu Lea Scr A^n Ser phe Gin Ser 
4(> 4.5 MJ 5 5 

G I ■ ■ T~T GAC TCC CCG C j A I GAT TTC TAG Awl G AC GO A AG C TT G IT CTC 3o5 
\ a! Phe Asp Ser Pro A -p Asp Phe Tvr Scr Asp Aia LnkiAal Leu 
C,u f-5 "0 

TCC GAC GGC CGG GA A GTT TCT TTC CAC CGG TGC GTT TTG T( ' A GCG AG A 351 
Ser Asp Giy Are Glu Val Ser Phe His Arg Cys \ at Leu Ser Ala Arc 
-5 SO 85 

AGC TCT TTC TTC A AG AGC GCT TT A GCC GCC GCT A AG A AG GAG AAA GAG 401 
Ser Ser Phe Phe Lys Ser Ala Leu Ala Aia Ala Lvs Lvs Glu Lys Asp 
9!) 95 DO 

TCC AAC AAC ACC GCC GCC GTG A AG CI C GAG (TT AAG GAG ATI G(T a A G 44<) 
Ser Asn Asn Thr A'a Ala Val Lvs Leu Glu Leu Lys Glu lie Aia Ly^ 
105 J 10 115 

GA F T AC G A A GTC GGT TTC GAT TCG GTT GTG ACT GTT TTG GCT TAT GTT 49^ 
Asp Tvr Glu Val Glv Phe Asp Ser Val Val Thr Val Leu Ala Tyr Val 

12o 125 13d 135 

TAG AGC AGC AG A GTG AG A CCG CCG CC1 AAA GGA GTT TCT G A A TCC GCA 54- 
I vr Ser Ser Arg Val Arg Pro Pro Pro Lys Glv Val Ser Giu Cys Ala 
',40 145 150 

GAC GAG A AT TGC TGC CAC GTG GCT TGC CGG CCG GCG GTG (SAT TTG ATG 593 
Asp G Asn Cys Cys Mis Val Ala Cys Arg Pro Ala Val Asp. Phe Met 
155 100 105 

"TTG GAG GTT CTC I AT TTG GCT TTC ATC TIC AAG ATC CC f C j A A TT A A : I 04 1 

Leu Glu Val Leu Tvr Leu Ala Phe He Phe Lvs lie Pro Giu Leu lie 
ro J "5 1 80 

ACT CTC TAT GAG AGG CAC TP A TTG GAC GTT GTA GAC AAA GTT GTT ATA 6*9 
Thr Leu Tvr Gin Arg His Leu Leu Asp Val Val Asp Lvs Val Val He 

\k\ i9!j »95 

GAG GAC ACA TTC; GTT ATA CTC AAG CTT GCT A A T ATA TGT GGT AAA GCT 
Glu Asp Thr Leu Val lie Leu Lys Leu Ala Asn lie Cvs Glv Lvs Ala 
200 205 210 215 
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roT a Tu a a.;- ( '"i -x :tu o \t aga re : a a a uao a ft ait gtc a ag tct a a^ 

r v b Mc: _ ^ Lcj Leu A-p A r L : t \ . C.'.u ' ! e f ! c V j I L Ser Asn 
220 22 ^ 23* 

GTA GAT ATG GTT ACT CTT G A A A AG PC A TTG CGG GAA GAG GTT GIT AAA 
Vu! A ? p Mc! \ al Ser Leu Glu Lvs Scr Leu Pro Glu Giu Leu \ a I Lvs 

245 

-AG ATA ATT GAT V , A GGT A AA GAG CTT GGT TTG GAG G FA CGI AAA GTA 
Glu Me He Asp Are Are l.\s ( }!u Leu Gl> Leu Glu Val Pro Lys V ji 

Z5;i 2< 5 260 

\ AG AAA CAT GTC TCG A AT GTA CAT A AG GCA CTT GAC TCG GAT GAT ATT 
Lys Lvs His Vui Ser Asn \ ai His Lvs Aia Leu Asp Ser Asp Asp lie 
'2fn 2T> 2^5 

GAG TLA GTC AAG TTG CTT TTG AAA G AG GAT CAC ACC A AT CTA GAT GAT 
Glu Leu Val L>s Leu Leu Leu Lvs Glu Asp His Thr Asn Leu Asp Asp 

2 Si! 2S5 290 2 r G> 

< ,Cv. TOT GC T CTT' CAT TLC OCT GTT GCA CAT TGt A A T GTG AAG ACC GCA 
Ala Cys Ala Leu His Phe Ala V al Ala Tvr C> s Asn Val Lys I hr Ala 
3 pn 305 310 

AC A GAT CTT TTA AAA CTT GAT CTT GCC GAT GTC AAC CAT AGG A A T CCG 
Thr A>p Leu Leu L>s Leu Asn Leu Ala Asp Val Asn I Its Are. Asn Pro 
3: 5 32o 32" 

AGG GG A TAT ACG GTG CTT CAT GTT GCT GCG ATG CGG AAG GAG CCA CAA 
Ar^ Glv Tvr Thr \'ai Leu His Val Ala Aia Mel Are Lvs Glu Pro Gir. 

3 30 3 3^ 340 

TTG ATA CTA TCT CTA TTG GAA AAA GGT GCA AGT GCA TCA GAA GCA ACT 
Leu lie Leu Ser Leu Leu Glu Lys Glv Ala Ser Ala Scr Glu Ala Thr 
345 350 355 

TTG GAA GGT AGA ACC GCA CTC ATG ATC GCA AAA CAA GCC ACT ATG GCG 
L.cu Glu Gly Are Thr Ala Leu Met lie Ala Lvs Gin Ala Thr Met Ala 

.H>i. 30^ 371; ' o 

GTT GAA TGT A AT A AT ATC CGG GAG GAA i GG A AO v. A i U i L U AAA GGC 
Val G<u Cys Asn Asn lie Pro Giu Gin Cys Lys His Ser Leu Lys Giv 
^Ru 3? 5 39 fl 

GG A CTA TCT GTA GAA AT A CTA GAG CAA G A A GAC AAA CO A GAA CAA ATT 
Are Leu Ovs Va! Glu lie Leu Glu Gin Q\u Asp Lys Arg Glu Gin He 
} 9 5 400 4U5- 

CCT AGA GAT GTT CCT CCC TCI' TIT GCA GTG GCG GC 0 GA T GAA TTG AAG 
Pro Arg Asp Val Pro Pro Ser Phe Ala Val Ala Ala Asp Glu t cu L>s 

4 0= 4! 5 420 

ATG ACG CTG CTC GAT CTT GAA A AT AGA GTT GCA (T T GCT CAA CGT CTT 
Met Thr Leu Leu Asp Leu Glu Asn Are Val Ala Leu AG G-r Are Leu 
42^ 4su 4^5 

TTT f c A ACG GA A GCA CAA GCT GCA ATG GAC, AG' . : \ w\\ MG AAGGGA 
Phe P^ Thr Giu Ala G n Ala Ala Me! ( 0 1 Le Aia ( ;!u Me: ! *si.h 
44<; 44 5 45 it 4 5 < 
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v a r.yr ac ttc a; \ gtg act agc ~ re g ag <"ct g ac cgt '. act got : - - 

[ hr tvs Glu Phc :,cGiTfir Scr Leu Giu Pro Asp Arg Lcj 1 nr Giy 
400 4":> 

A t' X"i AAG AGA ACA TCA CCG GGT GTA AAG A'i A GCA CCT TTC AG A ATC CTA I 55.< 

7 hr L>s Arg I hr Ser Pro G!> Va! Lys 11c Ala Pro Phe Arg lie Lcj 
a"5~ 48U 485 

G A A GAG CAT C A A ACT AGA CTA AAA GCG CTT TCT AAA ACC GTG G A A CTC 1001 
Glu Glu Mis Gin Ser Arc Leu Lys Ala Leu Sc L>s Thr Va! Giu Leu 
4^0 4'^ 500 

OGCi AAA CG A TTC TTC CCG CGC TGT TCG GCA GTG CTC G AC CAG ATT ATG ! o40 

G A Lvs Arc Phc Phe Pro Arc Cvs Scr Ala Val Leu Asd Gin lie Met 

>t 5 | • i 5 C 

AAC TGT GAG GAC TTG ACT C A A C TG GCT TGC GCjA GAA GAC GAC ACT GCT l<>97 
A'.n f'vs Glu Asp 1 cm Thr Gin Leu Ala ( As Gtv Glu Asp Asp Thr Ala 
5 2 n 525 53(" ! ^ 

(l AG AAA CGA CTA CAA AAG AAG CAA AGG TAC ATG GAA AT" A CAA oAU AGA I 745 

G'u I vs Art; [ eu Gin ! ys Lvs Gin Are Tvr Mc! Glu lie Gin Glu Thr 
540 545 550 

C [ a AAG AAG GCC TTT ACT GAG GAC A A T TTG GAA TTA GG A A AT TCG TCC I 793 

Leu Lys L\ s Ala Phc Scr Glu Asp Asn Leu Glu Leu Glv Asn Scr Ser 
55n 500 505 

GTG ACA GAT TCG ACT TCT TCC ACA TCG AAA TCA ACC GGT GG A AAG AGG 1841 
Leu Thr Asp Ser Thr Ser Ser Thr Ser Lys Ser Thr Giy Gly Lvs Arg 
5 70 5 75 5 SO 

TCT AAC CGT AAA CTC TCT CAT COT CGT CGG TG AG ACTCTT GCCTCTTAGT GTA 1894 
Ser Asn Arg Lys Leu Ser His Arg Arg Arg 
5S5 590 

ATTTTTGCTG T ACC A TATA A TTCTGTTTTC A PC ATG ACT G TAACTGTTTA TGTC1 ATCGT 
TGGCGTCATA TAGTTTCGCT CTTCGTTTTG CATCCTGTGT ATTATTGCTG CAGGTGTGCT 
TCAAACAAAT GTTGTAACAA TTTG A ACCA A TGGTATACAG ATTTGTA A'l A TATATTT ATG 
TA CATC A ACA A T A A A A A A A A A A A A A A A A A A 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 593 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY': linear 

In) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 3 

Me: Asp Thr Thr lie Asp Gly Phe Ala Asp Ser Tvr Giu lie Ser Se- 
5 'o 15 



1954 
20] 4 
2074 
2104 
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; hr ^er Phe V ui Ala Tir Asp Tnr A ^ ^ er ,ie Vj ^!- r Leu 

Aia Aia Giu v.l In \ j I L eu Thr Glv Pro Asp W. Ser Ala Le j G in Lee: 

;s 4o 4 5 

^eu be- Asn Scr Hho Glu Scr \ a I Phe Asp Ser Pro Asp Asp Phe Tyr 

s e r A A: a i vs Levi \ a I Leu ^er A >p t ■ ; > Arg Giu Vai Se r Phe His 

\ r ;(\? \ j| Leu Scr Aia Arg Scr Scr Phe Phe L\s Scr Ala Leu Ala 

Ata Ala Lvs Lvs Glu Lv- Asp Ser Asn Asn Thr Ala Ala Val Lys Leu 

i .ji, 105 11" 

Glu Leu Lvs Glu lie Aia Lvs Asp Tyr Glu Vai Giv Phe Asp Ser Val 

115' i:o 125 

\ a! Thr Val Leu Ala 'I yr Val I yr Scr Scr Arg Vai Arg Pro Pro Pro 

}\tj 135 I4i » 

L vs i.'iK ^ ji Ser Giu 0 ys Aia Asp Giu Asn Lys Cys His Vai Ala Cys 
,45 150 155 160 

Art' Hro Ala Vai Asp Phe Met Leu Glu Val Leu Tvr Leu Ala Phe lie 

! < > 5 : ~" ' 1 : " 5 

fhc L>s ; ie Pro Giu Leu lie Thr Leu T\ r Vjin Are H is Leu Leu Asp 

I it 1 185 I0i, 

\ a! Val Asp Lys Val \ al lie Glu Asp Thr Leu Val lie Leu Lys Leu 

195 200 205 

Ala Asn lie Cys GK Lvs Ala Gys Met Lys Leu Leu Asp Arc Cys Lys 

210 2! 5 220 

Glu He He Val I v? A^n V'-a' Asp Me' V.i! I eu Glu I vs Ser 
22 5 23<; 23 5 24d 

Leu Pro Glu Glu Leu Va! Lvs Glu lie He Asp A'g Arg Lvs Giu Leu 

245 25o 255 

Glv Leu G'.u Va! Pro Lys Val Lys Lvs His Val Ser Asn Val His Lys 

260 265 270 

Ala Leu Asp Scr Asp Asp He Glu Leu Val Lvs Leu Leu Leu Lys Glu 

2^5 280 2S5 

Asp His Thr Asn Leu Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala 

290 295 30f - 

'! vr Cys Asn Val Lys Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala 
305 310 ' 315 320 

Asp Va' Asn His Arg Asn Pro Arc G!y Tyr Thr Val Leu I0> Val Ala 

325 330 335 

Ala Me; Aig, Lv.-> Glu P:<~ Gin L<-u !!c L^u S<_; L<_u Glu Ly^ C0> 

340 345 350 

Ala Ser Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met lie 

3 5S 360- 365 

Ala Lvs Gin Ala Thr Me: Ala Va! G!u Cys Asn Asn He Pro Glu Gin 

3^1 3T5 380 

< V, L V: ; His Ser Leu Lys Glv Arg Leu Cvs Va! G!u lie Leu Glu Gin 
3?. 5 3 v Or 395 400 

Giu Asp Lvs Arg Glu Gin He Pro Arg Asp Vai Pro Pro Ser Phe Ala 

405 4 10 415 

\ ai Aia Ala Asp Glu Leu Lys Me! Thr Leu Leu Asp Leu Glu Asn Arg 

420 425 430 

V a; Ala Leu Ala Gin Arg Leu Phe Pro 'I hr Giu Ala Gin Ala Ala Met 

435 44o 445 

Giu He Ala Glu Met Lvs Glv Thr Cvs Glu Phe lie Val Thr Ser 1 eu 

4<ii 45s 400 

C'u P^> Asp Are Leu Thr Gly Thr Lvs Arg Thr Ser Pro Gly Val Lvs 

Le AG Pro Phe Art: ! ie I eu Glu Giu H ^ Gin Ser Arg Leu Lvs Aia 
40i 1 4 l )5 
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Leu Ser Lvs :h- V a; Cvu Leu ON L> s A^ ?he Phe Pre Arc C>s Scr 

«.;.(; s f ; S: : 1 

A: a \ j; Leu Asp G-n lie Vic: \^n( vs i'iIl Asp Leu Thr Gin Lea AIj 

5 ; ^ 5 2 v 5 2 5 

^ ';. s G!v G,j Asp Asp Thr Ala GI'j Lvs Ar: Leu Gin Lvs Lvs Gin Are 
sin 535 s 4 ; , 

7 '. r Me: G iu lie Gin G!u Thr Leu Lvs L v ^ Ala Phe ^cr Glu Asp Asn 
545 5 50 5 55 5f>(> 

Leu G!l Lcj GIv Asn Ser Ser Lc- Thr Asp Ser Thr Ser Ser Thr Ser 

V>5 5?f.) 5 7<; 

I . v s >e: ! h r G I > GIv L y s A re Ser A Arc Lvs Leu Ser ! h s Arg A 
^S ; » 5 8" 5 ^><> 

A'*' 



(2) INFORMATION FOR SEQ ID NO:4 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY': linear 

(id MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4. 

Asn His Arg Asn Pro Arg Gly Tyr Thr Val Leu His Val Ala Ala Met 
1 5 10 15 

Arg Lys Glu Pro Gin Leu He Leu Ser Leu Leu Glu Lys Gly Ala Ser 

20 25 30 

Ala Ser Glu Ala Thr Leu Glu Gly Arg Thr Ala Leu Met lie Ala Lys 
35 40 45 

Gin 



(2) INFORMATION FOR SEQ ID NO:5; 

(w SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 ammo acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY linear 

(n) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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Asn Ala Lvs Thr Lys Asn Gly Tyr Thr Ala Leu His Gin Aia Ala Gin 
1 5 10 1 5 

Gin Giy His Thr His He He Asn Vai Leu Leu Gin Asn Asn Ala Ser 

2U 25 30 

Pro Asn Glu Leu Thr \'al Asn Gly Asn Thr Ala Leu Ala He Ala Arg 
35 4(» 45 

Are 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 28 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY": linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO:6. 

I ys V'al 1 vs I vs His Yal Ser Asn Yal His Lys Ala Leu Asp Scr Asp 
1 5 10 15 

Asp He Glu Leu Yal Lys Leu Leu Leu Lys Glu Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE- ammo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY, linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO Y 

I .vs Thr I vs Asn G!y Leu Ser Pro Leu His Met Ala Thr Gin Gly Asp 
\ 5 10 15 

His Leu Asn Cys Yal Gin Leu Leu Leu Ser Arg Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO:8: 

a } SEQUENCE CHARACTERISTICS: 
( A ) LLNU 1 H : 3 3 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS 

Lys His Val Ser Asn Val His Lys Ala Leu Asp Scr Asp Asp He Ciiu 
1 5 :0 15 

Leu Val Lys Leu Leu Leu Lys Glu Asp His Thr Asn Leu Asp Asp Ala 

20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO:9: 

(i ) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9 

Asp Asp Ala Cys Ala Leu His Phe Ala Val Ala Tyr Cys Asn Val Lys 
15 10 15 

Thr Ala Thr Asp Leu Leu Lys Leu Asp Leu Ala Asp Val Asn His Arg 
20 25 30 

Asn 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH. 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l<> 



Arg Gly Tyr Thr Val Leu His Val Ala Ala Mel Arg Lys Glu Pro Gin 
15 10 15 
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Leu He Leu Ser Leu Leu Glu Lys Giy Ala Ser Ala Ser Glu Ala Thr 
2<> 25 30 

Leu 

<2) INFORMATION FOR SEQ ID NO: I I 

( i ) SEQUENCE CHAR-AC I ERISTICS 
(A) LENGTH: 33 amino acids 
f B } TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xn SEQUENCE DESCRIPTION: SEQ ID NO: 1 1 : 

Glu Gly Arg Thr Ala Leu Met lie Ala Lys Gin Ala Thr Met Ala Val 
1 " 5 10 15 

Glu Cys Asn Asn He Pro Glu Gin Cys Lys His Ser Leu Lys Gly Arg 
' 20 25 30 

Leu 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION. SEQ ID NO: 12 

Gly Thr Pro Leu His Leu Ala Ala .Arg Gly His Val Glu Val Val Lys 
15 10 15 

Leu Leu Leu Asp Gly Ala Asp Val Asn Ala 1 hr Lys Ala lie Ser Gin 

20 25 30 

Asn Asn Leu Asp He Ala Glu Val Lys Asn Pro Asp Asp Val Lys Thr 

35 40 45 

Met Arg Gin Ser He Asn Glu 
5 ( ) 55 

(2) INFORMATION FOR SEQ ID NO. 13 
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(ti SEQUENCE CHARACTERISTICS. 
(A) LENGTH 21 "2 base pairs 
( B j TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

Mil MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO:13 

GTG ACTTTCT AAI.TA rGGCT GAAATTGCAG AACGAAAAAg ACTTIC C A GT TTTC ACTTG A <" > • J 

.\ t i . .\ a .\ i 7 \-A aaatggaaa: c \mcic\c- tgttcttct - : tttt act ac vgca rncc a t2<< 

I, )( ;rTr:r( C TCCTCTACCT TGCC 1 "AGCTC TITTCAATn CTAGAAl ATT CTTTTCT7AG I «0 

I GTG 7 A A TT -\ TCTaT AGC'.'G AATTTCTAAG At AGAACTTA TGTAAGGCGG CTTTCTGTA A 24o 

I GOAT A A TAG TAGGAGTGCG TTTTCTG ATI GG A A IGAGAT CAGCGGAAGC AGTaGTAT AT Vn: 

GCTGC A R/oG CGGCGGCATg AG I uAA IT I T iC IGGC GGGA GACTTCGgCg GCGGAGA JVA M«> 

fnCA'.'TG A A ACGCGTATGG GA \ A( AGTGG A ATGTA 7CTT GG A TGCGTCT TTG GCGG AG'T 420 

TTGACTACTT GGGGG ACGC1 A AC , ! TG I GO 7 i '1 <. GG( A C C GTGTAAG-jAA A TT <_ A GGT< ,< 4H.' 

A< TGGT GCAT TTTGTCGGGG AGGAGTGC C j T I ( '.'IT I A A ( r A A 7TTG TTGTGC G G T A A A A A G ( > S4o 

A I , A AG A AT AG I AG 'I AAGGTG OA \ TTG A AGO \GG rGATO A A A' i AGO ATG AG GTG aGCTA I'd ooo 

A TGGTGTA AT G AG TGTATTG GCTTATTTCT AT AGTGGTA A AGTI AGGCCT TCACCT A A AG* C,{>0 

A TG TGTGTGT TTGTGTGG AG A ATo" ACTGCT C FCATGTGGC TTGTAGGC C A GCTGTGGCA I 720 

TCCTGGTTGA GGTT'H GT AC AC A 1 GATTTA CC7TTGAGA r GTCTG A A TTG GTTGACAAGT 7?0 

TTGAGAGACA CCT ACTGG AT ATTGTTG AC A A A AGTGG AGC AG AC GATGTA A TG ATGGT7T 840 

TATCTGTTGC A A AG ATTTGT GGT'A A AGC AT GCG AG AGATT GCTTTG A AGC TGCATTgAGA 900 

TTATTGTCA A GTCTA ATGTT' GAT ATCATAA GCCTTGATA A AGCCTTGCCT GA TG AGaTTG 9Mi 

ja AAAGAAA T T AC TGATTCA CGAGCGGAAC TTGGTCTACA AGGgCCTGAA AGGA ACoGTI UOo 

TTOCTGA 7AA AC ATGTT A AG AGGATACATA GGGGATTGG A TTGTGATGAT GTTGA ATT AC 1080 

TATA A ATGTT GGTAAGAGAG GGGCATaGTA CCCTAGATGA TGGATATGC T CTCCATTATG I 140 

< TGTAGCG I A TTGCG ATGC A A AG ACTACAG GAG AAGTTGT AG ATCTTGC A <_ TTGc, TGATA^ i 200 
TTAATGATCA A A A TTG A A GC ) GGATACACGG TGCTGCATGT TCCAGCC ATG AGGA AAGAuC i260 
CTAAAATTGT AGTGTCCCTT Tr A AGGA A AG GAGCTAGAGC TTCTG ATCTG ACATCCGATG 1320 
GA AGAAAAGG ACTTCAAATC GC< AAGAGGG TCACTAGGCT TGTGGATTTC AGT A AGTCTC 1 JSO 

< GO AGGA AGO A A AATCTGGT TCGAATGATC GGTTATGC AT TG AC \ ATTCTG GAGCAAGCAG 1440 
A A AGAAGAGA CCCTCTGCT A GGaGAAGCTT CTGTATCTCT TGGT ATGOCA GGCG ATGATT I 500 
TG C'GTATG A A GGTGTTATAG CTTG A A A AT A G AGTTGGCCT GGGT A A ACTC GTTTTTGCA A lM»0 
TGGAAGCTAA AGTTGCAATG GACATTGCTC AAGTTGATGG CACTTCTGAG TTCCC:ACTGG 1020 
GTAGC ATCGG CAAAAAGATG GCTAATGCAC AGAGGACAAC AGTAGATTTG AACGAGGCTC loSO 
CTTTC A AG AT AAAAGAGGAG GACTTGAATC GGCTTAGAGC ACTCTGTAGA ACTgTAGAAC ! * ? 40 
TTCGAAAACG CTTCTTTCCA CGTTGTrCAG AAGTTGT AAA TAAG ATG ATG GATGLTGATg 1800 
ACTTGTCTGA GATAGCTTAC AT jGGG A ATG ATAGGGCaGA AGAGCGTCAA CTG A AG A AGC 1 KC>0 
AAAGGTAC^AT GGAAGTTGAA GAAATTCTGA GT A A A G GAT T G A C TCj A G G A 1 AAAGAAGAAT 1920 
ATG ATA AG AG TAACAACATG TCCTCATCTT GTTCCTGTAG ATGI'AAGGGA GTAGA IAAGG 19S0 
GGA A T A AGjGT CCCTTTT AGG AAA I AGO I A A ITG i A IT AGG AT A T A TG AG' . A AG A AG A GG A 2o4(» 

iTrrc ru. r a ac ataggagt ctttcctttc atcatttgat atgtgaagat agm acaaca 21 )o 

GGTGT AGG AT AA ACTTGTA V 1'GTTGC ACTT ACAACTTTGA A C j A A G AG A A T TT ATT I'CJ A A A 21G> 

A A A A A A A A A A A A - 1 2 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH. 588 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: single 
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([)) TOPOLOGY: linear 

Mi) MOLECULE TYPE: protein 

<xu SEQUENCE DESCRIPTION: SFO ID NO: 14: 

Met Asp Asn Ser Arg Thr Ala Phe Ser Asp Scr Asn Asp lie Ser Gly 
I s 10 15 

Ser Ser Ser He Cys Cys He Gly Gly Gly Met Thr Glu Phe Phe Ser 

">0 25 30 

Pro Glu~Thr Ser Pro Ala Glu He Thr Ser Leu Lys Arg Leu Ser Glu 

1> 40 45 

Thr Leu Glu Ser He Phe Asp Ala Ser Leu Pro olu Phe Asp Tyr Phe 
50 55 60 

-^.p A i a l vs Leu Val Val Scr lily Pro Cys Lys Glu lie Pro va. 

6s' ' ' "5 SO 

His Arg Cvs He Leu Ser Ala Arg Ser Pro Phe Phe Lys Asn Leu Phe 

8^ 90 95 

Cvs Glv Lvs Lvs Glu Lys Asn Ser Ser Lys Val Glu Leu Lys Glu Val 

"lOO 105 HO 

Met Lvs Glu His Glu Val Ser Tyr Asp Ala V al Met Ser \ ai ueu Ala 

US 120 125 

Tyr Leu Tvr Ser Glv Lys Val Arg Pro Ser Pro Lys Asp Val Cys Val 

'no ' 135 140 
Cvs Val Asp Asn Asp Cys Ser His Val Ala Cys Arg Pro Ala Val A.a 
145 150 155 160 

Phe Leu Val Glu Val Leu Tyr Thr Ser Phe Thr Phe Gin lie Ser Glu 

165 170 175 

Leu Val ^sp Lvs Phe Gin Arg His Leu Leu Asp He Leu Asp Lys Thr 

180 ' 185 190 

Ala Ala Asp Asp Val Met Mei v ai * <»' > *•>•• --- - - - 

195 200 205 

Lys Ala Cvs Glu .Arg Leu Leu Ser Ser Lys He Glu He lie Val Lys 

'-in ' 215 220 
ser Asn Val Asp He He Thr Leu Asp Lys Ala Leu Pro His Asp lie 



ilv ?40 



Val Lvs Gin He Thr Asp Ser Arg Ala Glu Leu Gly Leu Gin Gly Pro 

745 250 255 

Glu Ser Asn Gly Phe Pro Asp Lys His Val Lvs .Arg He His Arg Ala 

^60 265 270 

Leu Asp Ser Asp Asp Val Glu Leu Leu Gin Met Leu Leu Arg Glu Gly 

->^5 280 2S5 

His Thr Thr Leu Asp Asp Ala Tyr Ala Leu His Tyr Ala Val Ala Tyr 
*K)i ) 2 l >5 300 
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C>s Asp Ala Lys Thr Thr Ala Glu Leu Lea Asp Leu Ala Leu Ala Asp 

305 3 10 3 15 32" 

He Asn His Gin Asp. Ser Arg Giv Tyr Thr \ ai Leu His Ya! Ala Ala 

325 330 335 

Met Arg Lys Glu Pro Lys He Yal Yal Ser Leu Leu Thr Lys Glv Ala 

340 345 350 

Arg Pro Ser Asp Leu Thr Ser Asp Glv Arg Lys Ala Leu Gin He Ala 

35 5 360 365 

Lys Arg Leu Thr Arg Leu Yal Asp Phe Ser Lys Ser Pro Glu Glu Gly 

?"0 375 380 

Lys Ser Ala Ser Asn Asp .Arg Leu Cys lie Glu He Leu Glu Gin Ala 
3S5 39o 395 400 

Glu Arg Arg Asp Pro Leu Leu Gly Glu Ala Scr Yal Ser Leu Aia Met 

405 410 415 

Ala Gly Asp Asp Leu Arg Met Lys Leu Leu Tyr Leu Giu Asn Arg Yal 

420 425 430 

Gly Leu Ala Lys Leu Leu Phe Pro Viet Glu Ala Lys Yal Aia Met Asp 

435 440 445 

lie Ala Gin Yal Asp Gly Thr Ser Glu Phe Pro Leu Ala Ser He Gly 

450 455 460 

Lys Lys Met Ala Asn Ala Gin .Arg Thr Thr Yal Asp Leu Asn Glu Ala 
465 470 475 480 

Pro Phe Lys lie Lys Glu Glu His Leu Asn Arg Leu Arg Ala Leu Ser 

485 490 495 

Arg Thr Yal Glu Leu Gly Lys Arg Phe Phe Pro Arg Cys Ser Glu Val 

500 505 510 

Leu Asn Lys He Met Asp Ala Asp Asp Leu Ser Glu He Ala Tyr Met 

515 520 525 

Gly Asn Asp Thr Ala Glu Glu .Arg Gin Leu Lys Lys Gin Arg Tyr Met 

530 535 540 

Glu Leu Gin Glu lie Leu Thr Lys Ala Phe Thr Glu Asp Lys Glu Glu 
545 550 555 560 

Tyr Asp Lys Thr Asn Asn He Ser Ser Ser Cys Ser Ser Thr Ser Lys 

565 570 575 

Gly Yal Asp Lys Pro Asn Lys Leu Pro Phe Arg Lys 
580 585 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS 
( A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
GTGACAGACT TGCTCCTACT G 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(n) MOLECl.'LE TYPE: DNA 

(\i) SEQUENCE DESCRIPTION SEQ ID NO:l0: 
C AGTGTGTAT CAAAGCACCA 

(2) INFORMATION FOR SLQ ID NO: 1 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 17: 
TTCTCCAGAC CACATGATTA T 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA 

(\i) SEQUENCE DESCRIPTION. SEQ ID NO IS 



WO 98 06748 



PCTX S971J994 



-Si » 

TGAAGCTAAT ATGCACAGGA G - I 

«2) INFORMATION FOR SEQ ID NOiv 

(i) SEQUENCE CHARACTERISTICS 
( A) LENGTH: 21 base pairs 
(B ) TYPE: nucleic acid 
(C> STRAND EDNESS: single 
( D ) TOPOLOGY: linear 

CD MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 19 

GTAGGTGCTC TTGTTCTTCC C 21 

(2) INFORMATION FOR SEQ ID NO 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH. 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 
!D) TOPOLOGY: linear 

«ii) MOLECULE TYPE:DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 

CACAT AATTC CCACG AGGAT C 2 1 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH. 17 amino acids 
( B i TYPE: ammo acid 
(O STRANDEDNESS: single 

(D) TOPOLOGY linear 

in) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 21 

Met Lys Giy Thr Cys Glu Phe lie Val Thr Ser Leu Glu Pro Asp Arg 
1 5 10 1? 

Leu 
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(21 INFORMATION FOR SEQ ID NO 22 

(it SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

in) MOLECULE TYPE: protein 

(X!) SEQUENCE DESCRIPTION: SEQ ID NO 22 

Arg Arg Lvs Glu Leu Gly Leu Glu Val Pro Lys Yal Lys Lys 
! 5 i ' 1 

(2) INFORMATION FOR SEQ ID NO:23: 

(0 SEQUENCE CHARACTERISTICS: 
I A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Lys Lys Gin Arg Tyr Viet Glu He Gin Glu Thr Leu I ys Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:24: 

.1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 base pairs 

(B) TYPE: nucleic acid 

(L) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

in) MOLECULE TYPE: DNA 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO:24 
AAR( i AR» iAYC AYACNAA 



1 7 
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ll) INFORMATION FOR SEQ ID NO:2> 

ID SEQUENCE CHARACTERISTICS: 
(A) LENGTH: ] 7 base pairs 
I B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(K.) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TAYGTYAAYG TNAARAC 

<2> INFORMATION FOR SEQ ID NO:26 

(i) SEQUENCE CHARACTERISTIC'S. 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

UU MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO.26: 
GCCATNGTNG CYTGYTT 

(2) INFORMATION FOR SEQ ID NO: 2" 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY linear 

(i;t MOLECULE TYPE: DNA 

i XI ) SEQUENCE DESCRIPTION: SEQ ID NO:27 

AARGTN AARA ARCAYGT 

(2) INFORMATION FOR SEQ ID NO:28: 

ID SEQUENCE CHARACTERISTICS 
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( A) LENGTH : 21 base pairs 
( B) TYPE: nucleic acid 
(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

.11) MOLECULE D N A 
<xi) SEQUENCE DESCRIPTION SEQ ID NO:2S: 

RAAYTCRCAN GTNCCYT I CA T 
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Claims 

1 An isolated nucleic acid molecule comprising a sequence encoding an acquired 
resistance polypeptide, wherein said acquired resistance polypeptide ,s capable of conferring, 
on a plant expressing said polypeptide, resistance to a plant pathogen 

2. The isolated nucleic acid molecule of claim 1 . wherein said polypeptide is capable 
of mediating the expression of a pathogenesis-related polypeptide. 

3 The isolated nucleic acid molecule of claim i. wherein said polypeptide comprises 
an ankyrin-repeat motif. 

4. The isolated nucleic acid molecule of claim 1. wherein said polypeptide is obtained 
from an angiosperm. 

5. The isolated nucleic acid molecule of claim 4. wherein said angiosperm is a 
member of the Solanaceae or the Cruciferae. 

6 The isolated nucleic acid molecule of claim 1 , wherein said nucleic acid molecule 
is genomic DNA or cDNA. 

aid plant nathoeen is a 
I he isoiaieu nucleic a<-iu moiccui^ v-iJi.i. ...... ...i- i 

bacterium, vims, viroid. fungus, nematode, or insect. 

8 An isolated nucleic acid molecule that encodes an acquired resistance polypeptide 
that specifically hybridizes to a nucleic acid molecule comprising the genomic nucleic acid 
sequence of Fig. 4 (SEQ IDNO:l). 

9. An isolated nucleic acid molecule that encodes an acquired resistance polypeptide 
thai specifically hvbridizes to a nucleic acid molecule comprising the cDNA ofl-ig. 5 (SEQ 
ID NO:2). 
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in An isolated nucleic acid molecule that encodes an acquired resistance polypeptide 
that specifically hybridizes to a nucleic acid molecule comprising CDNA sequence of Fig \A 
(SEQ ID NO:13) 

5 11. The isolated nucleic acid molecule of claims 8- lu, wherein sa]d nucleic acid 

molecule encodes a polypeptide that mediates the expression of a pathogenesis-related 
polypeptide 

12 The isolated nucleic acid molecule of claims 8-10. wherein said nucleic acid 
10 molecule encodes a polypeptide comprising an ankvnn-repcat motif 

13. The isolated nucleic acid molecule of claims 1 or X-l (), wherein said nucleic acid 
molecule is operably linked to an expression control region 

15 14. A vector comprising the nucleic acid molecule of claims 1 or 8-10, said vector 

beine capable of directing expression of the polypeptide encoded by said nucleic acid 
molecule. 

1 5. A cell comprising an isolated nucleic acid molecule of claims 1 , 8-1 0, or 14. 

20 

16. The cell of claim 15, wherein said cell is a plant cell. 

1 T The eel! of claim 1 5, wherein said cell is a bacterial eel!. 

25 I S. The cell of claim 1 7, wherein said bacterial cell is Agrohactenum. 

19. The cell of claim 16. wherein said plant cell has increased resistance to a plant 
pathogen, 

30 20. A transgenic plant composing a nucleic acid molecule of claim 1, 8-10, or 14, 
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\\ herein said nueieic acid molecule is expressed :n said transgenic plant 

21. The transgenic plant of claim 2<>, wherein said transgenic plant is an angiosperm. 

22. The transgenic plant of claim 2tK wherein said transgenic angiosperni is a 
monocot or a dicot. 

23 The transgenic plant of claim 20, wherein said dicot is a cruciferous plant or a 
soianaceous plant 

24. A seed from a transgenic plant of claim 20 
25 A cell from a transgenic plant of claim 20. 

26. A substantially pure acquired resistance polypeptide including an ammo acid 
sequence that has at least 40% identity to the amino acid sequence of Fig. 5 (SEQ ID NO;3) 
or Fig. 7B (SEQ ID NO: 14). 

2 7 . The substantially pure polypeptide of claim 26, wherein said polypeptide is 
capable of mediating the expression of a pathogenests-related polypeptide. 

25. The substantially pure polypeptide of claim 26, wherein said polypeptide includes 
an ankvnn-repeat motif or a G-protein coupled receptor motif 

2v>. 1 he substantially pure polypeptide of claim 26. therein said polypeptide is 
obtained from an angiosperm. 

30. The substantially pure polypeptide of claim 2 ( >, wherein said angiosperm is a 
member of the Solanaccae or the Cruciferae. 
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3 1 A method of producing an acquired resistance nohpeoiide, said method 
comprising the steps of: 

(a) providing a cell transformed with a nucleic acid molecule of claims 1 or 
S-10 positioned for expression ;n the cell. 

5 (b) cultunng the transformed ceil under conditions for expressing the nucleic acid 

molecule; and 

(c) recovering the acquired resistance polypeptide 

32 A recombinant acquired resistance polypeptide produced by the method of claim 

10 31. 

33. A substantially pure antibody that specifically recognizes and binds to an 
acquired resistance polypeptide or a portion thereof. 

1 5 34 The substantially pure antibody of claim 33, wherein said antibody recognizes and 

binds to a recombinant acquired resistance polypeptide or a portion thereof. 

35 . A method of providing an increased level of resistance against a disease caused 
bv a plant pathogen in a transgenic plant, said method comprising the steps of 

20 (a) producing a transgenic plant cell including the nucleic acid molecule of claims 1 

or 8-10, wherein said nucleic acid is positioned for expression in the plant cell; and 

(b) growing a transgenic plant from the plant cell wherein the nucleic acid molecule is 
expressed in the transgenic plant and the transgenic plant is thereby provided with an 
increased level of resistance against a disease caused b\ a plant pathogen. 

25 

36. The method of claim 35. wherein said plant pathogen is a bacterium, virus, viroid. 
fungus, nematode, or insect. 

37. The method of claim 35, wherein said plant pathogen is Phvtophthora. 
30 Pcroriospora, or Pseudomonas 
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A method of isolating an acquired resistance gene or fragment thereof, said 
method comprising the steps of; 

(a) contacting the nucleic acid molecule of Fig 4 (SEQ ID \Oi ), Fig. 5 (SEQ ID 
? NO;]), or F:g. ~A (SEQ ID NO:l 3) or a portion thereof with a preparation of DNA from a 

plan.: cell under hybridization conditions providing detection of DNA sequences having at 
least 40 u n or greater sequence identity to the nucleic acid sequence of Fig. 4 (SEQ ID NO: 1 >. 
Fig 5 (SEQ ID NO:2h or Fig. 7A (SEQ ID NO:13): and 

(b) isolating said hybridizing DNA 

10 

3'/. A method of isolating an acquired resistance gene or fragment thereot. said 
method comprising the steps of: 

(a) providing a sample of plant cell DNA; 

(b) prov iding a pair of oligonucleotides having sequence identitv to a region of the 
15 nucleic acid of Fig. 4 (SEQ ID NO: 1 ), Fig. 5 (SEQ ID NO:2), or Fig. 7A (SEQ ID NO:13); 

(c) contacting the pair of oligonucleotides with said plant cell DNA under conditions 
suitable for polymerase chain reaction-mediated DNA amplification; and 

(d) isolating the amplified acquired resistance gene or fragment thereof. 

2n 40 The method of claim 3 C >, wherein said amplification step is carried out using a 

sample of cDNA prepared from a plant cell. 

41 The method of claim 30, wherein said pair of oligonucleotides are based on a 
sequence encoding an acquired resistance polypeptide, wherein the acquired resistance 
2:^ polypeptide is at least 40% identical to the amino acid sequence of Fig. 5 (SEQ ID NO:3 ) or 

Fig 7B (SEQ ID NO:14). 
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10 2 0 3 0 4 0 5 0 

• • • * • 

A-AGCTTGTGA TGCAAGTCAT GGGATATTGC TTTGTGTTAA GTATACAAAA 

TTCGAACACT ACGTTCAGTA 33CTATAACG AAACACAATT CATATGTTTT 

6 0 7 0 SO 9 0 100 

• * » « • 

CCATCACGTG GATACATAGT OTTCAAACCA AC C ACT AAA 3 AGTATCAGGT 

3GTA. GTGCAC 3TATGTA.TCA 0AAGTTT3GT TGGTGATi. . _j i _A*AGTC^_A 

110 120 130 140 ISO 

OATACCAAAG CCAGAAGTGA AGGGT7GGGA TATGTCATTG GGTTTAGCGG 

3TAT3GTTTC 33TCTTCACT TCCCAACCCT ATACAGTAA2 C2AAATCGCC 

16 0 172 IS 2 19 0 20C 

• * • » » 

TAATCGGATT GAACCCTTTC CGGTATAAAA TACAAAGGCT TTC3CAGTCT 

ATT AOOCTA.^ 0TTGGGAA-A3 G2CATATTTT ATGTTTCCGA A_AG>~GTCAvjA 



z 4 3 2 5 0 



0GGC 3TATGT GTATGTCTCG GGGTATCTAC CATTTGAATC ACAGAACTTT 

GCCGC AT AC A CATACAGAGC CCCATAGATG GTAAACTTAG 7GTCTTGAAA 

260 270 280 29 0 300 

TATGTGCGAA GTTTTCGATT "TGATTCOTT TA"TGGAAG AG ATT AG AAA 

ATACACGCTT C AAAA G C T AA GACTAAGCAA ATGGACCTTC TCTAATCTTT 

3 10 3 2 0 3 3 J 3 4 3 3 5 0 
*«••*•» 

TTTGCGTCTA CCAAAAACAG AO AG ATTAAT TTTTTCCAA 7 C C G A T AC AAG 

AAACGCAGAT GGTTTTTGTC TGTCTAATTA AAAAAGGTTG GGCTATGTTC 



3 6 0 3^0 333 390 



*4 sj W 



TTTCGGGGTT CTTGCATTGG ATATCACGGA ACAACAATGT GATCCGGTTT 
AAAGCCCCAA GAACGTAACC TATAGTG OCT TGTTGTTACA CTAGGCC AAA 



4 10 42 0 43 0 440 



TGTCTCAAAA CCGAAACTTG GTCCTTCTTC CATACTCCGA ACTCTGATGT 
ACA 3AGTTTT GGCTTTGAA. C CAGGA.AGAAG GTATGAGGCT TGAGACTACA 



46C 47 2 490 490 



=: r. o 



7TTCTCAGGA TTAGTCAGAT ACGAAGC3AA GCTAGGTGCT ATTCGTCAGT 

A.AAGAGTCCT AATCAGTCTA TGCTTCCCTT CGATCCACGA TAAGCA 3TC A 

510 520 53 : 540 550 

* # * * * 

GGACAAACAA AGATCAAGA.A GATGTTCAC3 AGTTATGGGT TTTAAAGAGC 
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CTACAAGTG _ TCAATA^CCn. AAATT1*CT*CG 
53 0 5 9 0 6CC 

TAAAGTGAAA gatattaaaa gcattggagt 

TCAAAACTTT TCAGCACCCA ATTTCACTTT CTATAATTTT CGTAACCTCA 

61C 62 0 63 0 640 650 

• • • • • 

AGATTTGATT ACGTGGACTC CAAGCAACGA CGTTGTATTG TTTCGTAGTA 
TCTAAACTAA TGCACCTGAG GTTCGTTGCT GCAACATAA_ AAAGCATCAT 

660 670 690 690 700 



GTGATCG^GG TTGCCTCTAC AACATAAACG CAGAGAA^TT ^_»AATTTAGTT 

CAC.AGCA~„ AA-— — 'GAGATG TTGTATTTGC GTOTCTTlaA TAAATCAA 

710 72C 730 740 750 

* ♦ * * * 

TATGCAAAAA AAGAG'GGATC TGATTGTTCT TTCGTTTGTT TTCCGTTTTG 

AT ACGTTTTC TTCTCCCT AG ACT AA C AAG A AAGCAAAC AA. AAGGCAAAAC 

76 0 77C 780 790 80 0 

TTCTGA7TAC GAGAGGGTTG ATCTGAACGG AAGAAGCAAC GGGCCGACAC 

AAGACTAATG CTCTCCCAA.C TAGACTTGCC TTCTTCGTTG CCCGGCTGTG 

810 820 83C 840 850 

» * » * * 

TTTAAAAAAA AAATAAAAAA AATGGGCCGA CAAATGCAAA CGTAGTTGAC 

AAATTTTTTT TTTATTTTTT TTACCCGGCT GTTTACGTTT GCATCAACTG 

860 870 880 890 900 

* • * * • 

.AAGGATCTCA AGTGTCAAGT CTCAATTGGC TCGCTCATTG TGGGGCATAA 

TTCCTAGAGT TCAGAGTTCA GAGTTAACCG AGCGAGTAAC AGCCCGTATT 

910 920 93 0 940 950 

• * * • * 

ATATATCTAG TGATGTTTAA TTGTTTTTTA TAAGGTAAAA AGGAATATTG 

TATATAGATC ACTACAAAT7 AACAAAAAAT ATTCCATTTT TCCTTATAAC 

960 97Q 99: 990 100C 

• * * • • 

AATTTTGTTT TTTAGGTTTA TGTAATAATA CCAAACATTG TTTTATGAAT 

TT AAAAC AAA GAATC C AAA T ALA . TA * . A * ^> ^ * . j ± AAC /-AAA » AC * T A 

1010 102C 103 0 1C40 105C 
***** 

ATTTAATCTG AT TTTTTGGC TAGTTATTTT ATTATATCAA GGGTTCCTGT 

T AAA TT AG A C TAAAAAACCG ATCAATAAAA 7AATATAGTT C CCAAGGACA 

1060 1070 1030 1090 1100 



F,'(r. f 
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77ATAGTTGA AAAC AGTTAC 7G7A7AGAAA A7AG7G7GGC AATTTTCTCT 
AATATCAAC7 TTTGTCAATG ACATA7CTTT 7A7CACAGGG 77AAAAGAGA 



1110 
* 

C7TAAA7AA7 
GAATTTATTA 

nee 

AATAATA7C7 
77ATTATAGA 

1210 

ATTGTTTAC A 
TAACAAATGT 



1120 
« 

ATATTAGTTA 
TA7AATCAA7 

1170 

AAAGCAACAC 
7T7CG77GTG 

1220 

7ATA777A7A 
ATATAAATAT 



1130 

ATAAAAGATA 
7A777TCTA7 

119 0 

A7A77TAGAC 
7A7AAA7CTG 

1230 

GCTTACCAAT 
CGAATGGTTA 



1140 
*• 

TTTTAATATA 
AAAA7TATAT 

1190 

ACAACACG7A 
7G7TGTGCA7 

1240 

ATAACCCGTA 
TATTGGGCAT 



1150 
* 

TTAGATATAC 
AATCTATATG 

1200 

ATATCTTACT 
TA7AGAATGA 

1250 

TC7ATGTTTT 
AGATACAAAA 



1260 

ATAAGGTTTT 
7AT7CGAAAA 



i n n n 

A. «. i w 

ATACAATATA 
TATG7TATAT 



1280 
* 

TGTACGGTAT 
ACATGCCATA 



12 90 

GCTGTCCAGG 
CGACAGG7GC 



13 00 

7A7ATATATT 
ATATATATAA 



1310 

CTCCAAAAAA 
GAGG71 U I U 7TT 

1360 

GGTG7TTATC 
C C AC AAAT AG 

1410 

GAAGATAAAA 

v 4 ± GT A *. A * *■ 

14 6 0 
* 

CATTTTAAAA 
GTAAAAT7T7 

1510 

GATAATTAGT 
CTATTAATCA 

156C 

77AT7G7AAA 
AA7AAG ATTT 



1320 

AACGCATGGT 
TTGCGTACCA 

1370 

TAAAGTTTAT 
ATTTCAAATA 

1420 

AAATTATATC 
7T7AATATAG 

1470 

AATTAATTAA 
TTAA7TAATT 

1 c ~> o 
A. J <i w 

AAAA7TAATT 
7T7TAATTAA 

1570 

777AG7TAAA 
AAA7GAA777 



1330 

ACACAAAA77 
7G7G7T7TAA 

1380 

CACAATATTT 
GTG77A7AAA 

1430 

AGA77GA7TC 
7 C 7 AA. C 7 AA G 

1480 

AAGAAAAC7A 
77C77TTGA7 

1 c t n 
j j ^ 

AAA7ATGTGA 
TT7A7ACACT 

1580 

A7CA7ACAAA 
TAG7A7G7TT 



1340 

7 ATT AAAT A7 
ATAAT77A7A 

1390 

ATCAACTATA 
TAGTTGATAT 

1440 

AA7TAAATTT 
77 AA 777 AAA 

1490 

777CATAAAA 
AAAG7A7T77 

15 4 0 

7GCTAT7GAA 
ACGATAAG7T 

1 5 9 C 

7C7TA7CG7A 
AGAA7AGG A 7 



1350 

T7GGCAATTG 
AACCGTTAAC 

1400 

ATAGA7GGTA 
TA7CTACCAT 

1450 

TATAATATAT 
A7AT7ATATA 

15 00 

77GTTCAAAA 
AACAAGTTTT 

15 5 0 

77ATAGAGAG 
AATATCTCTC 

160C 

A777AACTTA 
7AAATTGAAT 



; - 4 o 
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CC77A7TA7A AC77C7A7A7 AAAGTACTCT GTT7A7TCAA CA7AA7CTTA 

GGAATAA7A7 7GAGGA7A7A TTTCATGAGA CAAATAAGTT GTAT7AGAA7 

1710 1"23 1733 174C 1750 

* * * * • 

CG77G77G7A 77CATAGGCA 7CTTTAACCT ATCTTTTCAT TTT77GATC7 

'-j G AA A-r^C/- 1 . i .--A -j i A . C. G . A'oAAA - TGGA 7A.GAAAAG7A AAAGAC7AGA 

176: 1~70 1730 1790 190C 

* * » • , 

CGA7CG77T7 2GATCCAA2A AAATGAGTCT ACCGGTGAGG AACCAAGAGG 

GC7AGCAAAA 3C7AGG77G7 777AC7CAGA 7GG7CAC7CC 77GG77C7CC 

131 G 132 G 133 0 1840 1950 

* * * w • 

7GA77A7GCA GATTCC777T TG77C7GAG7 77CGA3CAAC ATCGAGTCCG 

ACTAA.7A2GT C7AAGGAAGA. AGAAGAG7 OA AAGG7CG77G TAGCTCAGGC 

1360 13-0 135 0 139 0 1900 

* # ♦ • . 

GAAAAGAOGA ATGAAGTGAA GGA7GAGCCA AATT7G7TTA GACGTG77AT 

C7777GTGG7 7AG77CAC77 CCTAC7CGGT 7 7 AAA 2 AAA 7 CTGCACAATA 

1910 192 0 1930 1940 1950 

GAA777GC77 77ACG7CG7A GTTATTGAAA AAGC7GA7T7 ATGGCATGA7 

C77AAACGAA AATGCAGCA7 GAA7AAC7T7 77 CG ACT AAA 7A.GCG7ACTA 

I96 0 19^0 1930 1990 200 0 

7CAGAACGAG AAG77GAAGG CAAATAAG7A AAGAAG7C77 77A7A7G7AT 

AG7C77GCT 0 77CAAG77CC G777A77GA7 77C77 OAGAA AA7A7ACA7A 

2 0i: 2 02 0 203 0 2040 205C 

ACAATAATTO 77777AAA7C AAA7CCTAA7 7AAAAAAATA 7ATTCA77A7 

7GT7A7TAAC AAAAAT77AG T77AGGAT7A A77777T7AT ATAAGTAATA 

2060 23-0 2030 2090 J100 



uA^TTTCAi'j 77777AA7G7 AATT7A7TCC 7A7A7 27A7A A7GA.77777G 
-TOAAAO . A . AAAAA77ACA 77AAA7AAGG A 7 A T A' _» A 7 A 7 7 A 77 AAAAAC 

2110 212 0 2130 2140 2150 

»*•*»» 

77G7GAAGAG CG7TTTGA77 TGC7A7AGAA CAAGGAGAA7 AG77CCAGGA 
AAC AC7TCT 2 G C A AAA G 7 A A ACGA7ATC77 G77C07G77A 7CAAGG7CC7 
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AATATTCGAC TTGATTTAAT TATAGTGTAA ACATGCTGAA CACTGAAAA? 

TTATA-AGCTG AACTAAATTA ATATCACATT TGTACGACTT GTGACTTTTA 

2 213 2223 2233 2243 225 3 

TACTTTTTCA ATAAACGAAA AATATAATAT AC ATT AC AAA ACTTATGTGA 

ATGAAAAAGT TATTTGCTTT TTATATTATA TGTAATGTTT TGAATACACT 

2263 C2 "T 3 22S3 22 9 3 2330 



TATTTCGTAC TCTGAA-TTAT ATGCAAGGGA AlATAGTAAAlA. TGA^AGTTTCT 

2310 2323 233C 2340 2350 

AAATAAACAG AAATGTAACT TTCACATGTA AATCTAATTC TTAAATTTAA 

TTTATTTGTC TTTACATTGA AAGTGTACAT TTAGATTAAG AATTTAAATT 

2 3 6 0 2 3 7 C 2 3 8 C 2 3 9 0 2 4 C C 

AAAATAATAT TTATATATTT A T A T G AAAA T AACGAACCGG ATGAAAAATA 

TTTTATTATA AATATATAAJa TATACTTTTA TTG 2TTGGCC TACTTTTTAT 

241 C 242 0 2430 2440 2450 

AATTTTATAT ATTTATATCA TCTCCAAATC TAGTTTGGTT CAGGGGCTTA 

TTAAAATATA TAAATATAGT AGAGGTTTAG ATC.AAACCAA GTCCCCGAAT 

2460 2470 2480 2490 2500 

* # * » » 

CCGAACCGGA TTGAACTTCT CATATACAAA AATTAGCAAC ACAAAATGTC 

GGCTTGGCCT AACTTGAAGA GTATATGTTT TTAATCGTTG TGTTTTACAG 

2510 2520 2530 2540 2550 

* » » » • 

TCCGGTATAA ATACTAACAT TTATAACCCG AACC GGTTTA GCTTCCTGTT 

AGGCC ATATT TATGATTGTA AATATTGGGC TTGGCCAAAT CGAAGGACAA 

256C 2570 25S0 259 0 26 00 

ATATCTTTTT AAAAAAGATC TCTGACAAAG ATTCCTTTCC TGGAAATTTA 

TATAGAAAAA TTTTTTCTAG AGACTGTTTC TAAGGAAAGG ACCTTTAAAT 

2 610 2 5 2 0 2 5 3 0 2 5 4 0 2 6 5 0 



CCGGTTTTGG TGAAATGTAA A2CGTGGGA2 GA^GATGCT. 3T'. r 3A I'AT3". 

G G C 2 AlAAAC C AOT i TA0A7T T G G C A 2 C 7 TG C T 7 3 T AC C A.A 2 AA 37AT A.G A. 

266C 2670 263: 269 0 2 7 0 0 

* • * • . 

CAC2ACCACT CTCGTTGACT GGACTTGG7T CTG 2TCGTCA ATGGTTATCT 

GTGGTGGTGA GAGCAACTGA C 2 T 3 AlA C C 2- A. GAC3JAGCAGT TACCAATAGA 
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272: : ^ 3 : 274 c 275 c 

» • * * • 

7CGATCTTAA AGCAAATGCA GTTGATAAGG 7CTCTTCGTT GATTAGGAGA 
AGCTAGAATT TGGT7TAGG7 GAACTA77CC AGAGAAGCAA CTAATGGTG7 

276C 277C 2780 279C 26CC 

• * • * • 

GATCTCTTTA ATTTGTGAAT TTCAATTCA7 CGGAACCTGT TGATGGACAC 
CTAGAGAAAT TAAACACTTA AAGTTAAG7A GCC7TGGACA ACTAGCTGTG 

231C 282C 2830 2840 2 8 5 0 

» * » * * 

CACCA77GA7 GGATTCGCCG A77777A7GA AATCAGCAGC ACTAGTTTCG 
G7GG7AAC7A CCTAAGCGGO 7AAGAATAC7 - .Ao.Cu.Cj * jn. . CAAAG _ 

2 36 0 2872 2830 2390 2900 

v • * * • 

TCGCTACCGA TAACACCGAC TGC7C7AT7G TTTATCTGGC CGCCGAACAA 
AGCGA7GGG7 ATTGTGGCT j AGGAGATAAC AAATAGACCG GCGGC7TGT7 

2 91 0 292: 293 0 294C 2 9 5 0 

G7AC7CACCG GACC7GATG7 ATCTGCTCTG CAATTGCTCT CCAACAGCTT 
CATGAGTGGC CTGGACTACA 7AGA2GAGAC GTTAACGAGA GG7TGTGGAA 

2960 297C 2980 2990 3000 

* • • * 

CGAATCCGTC 7TTGAC7CGC CGGATGATTT CTACAGCGAC GCTAAGCTTG 
GCTTAGGCAG AAACTGAGGG GCCTACTAAA GATGTCGCTG CGATTCGAAC 

3010 3020 3030 3040 3050 

TTCTCTCCGA CGGCGGGGAA GTTTCTTTCC ACCGGTGGGT TTTGTCAGCG 
AAGAGAGGG7 GCCGGCZC^Z CAAAGAAAGG TGGCC ACGCA AAACAGTCGC 

3060 3070 3C80 3090 310C 

* . • * 

AGAAGCTCTT TCTTCAAGAG CGCTTTAGCC GCCGCTAAGA AGGAGAAAGA 
TCTTGGAGAA AGAAGTTCTC GCGAAATCGG CGGCGATTCT TCCTCTTTCT 

3 110 3 120 3130 3 140 3150 

• • * * * 

CTCCAACAAC AGGGCCGCCG TGAAG2TCGA GC7TAAGGAG ATTGCCAAGG 
GAGGTTGTTG TGGCGGCGGC ACT7CGAGCT CGAATTCCTC TAACGGTTCC 

3160 31^0 3180 3190 3200 

ATTACGAAGT CGGTTTCGAT 7GGG77G7GA C7GT777GGG TTATGTTTAC 
TAA7GCTTGA GCCAAAGCTA AGCCAACAC7 GACAAAACGG AATACAAATG 

3210 32 2 0 3 2 30 3 2 40 3 25 0 

***** 

AGCAG7AGAG TGAGACCGCC G 7 C 7 AAA GG A G777C7GAA7 GGGGAGACGA 



WO 98/06* T 48 1 0/34 



7CG70G7C7C AC7C7GGCGG CGGA777CC7 CAAAGAC77A 

3263 3270 3 2 9 0 3293 

* • • * 

GAA7TGC7GC C ACG7GGC77 GCCGGCCGGC GG7GGA7T7C 

C 7TAACG Av_ oTGv_ A - G AA. C jGv, w GG CCo v_ „ .-a - ^_ 7.rvA_rv^j 

33 i: 3 32C 3 3 3 C 3340 

77C7C7AT77 GGC77TCA7C 77CAAGA7CC C7GAA77AAT 

AAGAGA7AAA CCGAAAGTAG AAG7TC7AGG GACT7AATTA 

3360 3 37 C 3330 339C 

CAGGTAAAAC AGGATCTGCA 7TAAGC7A7G G77ACACATT 

G7CCA777TG TGGTAGACGT AATTCGATAC CAATG7GTAA 

3410 342C 343 0 3440 

* * » » 

TTC77ACTTG AGTACTTGTA TTTG7ATT7C AGAGGCAGTT 

AAGAA7GAAC TCATGAACAT AAACATAAAG TCTCCGTGAA 

3460 3470 J4d0 3 4 9 0 

* * * * 

GTAGACAAAG 7TGT7A7AGA GGACAGATTG GTTATACTCA 

CA7C7GT77C AACAATATCT CCTGTGTAAG CAATATGAGT 

3510 3520 3530 3540 

* • • • 

7A7A7G7GG7 AAAGCT7G7A 7GAAGC7A77 GGATAGATG7 

A7ATACACCA TTTGGAAC AT ACTTCGA7AA CCTA7C7ACA 

356C 3570 3530 3590 

TTGTCAAG7C TAATGTAGAT ATGGTTAGTC TTGAAAAGTC 

AACAG7TCAG ATTACATCTA TACCAATCAG AACTTTTCAG 

3610 3620 3630 364C 

* •* » * 

GAGC77G77A AAuAl-A ; AA » ^'jAIAjALjT AAA v jAu' — T'^ 

C7CGAACAA7 7TC7C7ATTA AG7A7C7GCA TT7CTCGAAC 

3660 3670 3630 3690 

* ♦ • * 

ACC7AAAGTA AAGAAAGATG TCTCGAATGT ACATAAGGCA 

TG G ATTTC AT 7TGTT7GTAC AGAGC77ACA 7G7ATTCGGT 

3710 372 0 3730 37 4 0 

* * * * 

ATGA7A77GA GTTAG7CAAG 77GC7T77GA AAGAGGATCA 

7 ACTA 7 AAC 7 CAATCAGTTC AACGAAAAC7 77CTCC7AGT 

376 0 3770 3 7 3C 379 0 



PCT/US97/ 1 3994 

¥ 

CGCGTCTGCT 

33CC 

A7G77GGAGG 
7 AC AAC CTCC 

3 3 50 

7AC7C7CTA7 
A7GAGAGA7A 

34 0 0 

CATGAATATG 
G7AC7TATAC 

3450 

ATTGGACG77 
TAACC7GCAA 

3500 

AGCTTGCTAA 
7CGAACGAT7 

3550 

AAAGAGATTA 
T77CTCTAAT 

36C0 

ATTGCCGGAA 
TAACGGCCT7 

3 650 

G77TGGAGG7 
CAAACCTCCA 

3700 

C77GACTCGG 
GAAC7GAGCC 

3 7 5 0 

CACCAATC7A 
GTGG77AGAT 

3 8 00 
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GA7GA7GCG7 G7GC7CTTCA 777CGC7G77 GCA7A7TGCA A7G7GAAGAC 

CTAC7ACGCA CAC3AGAAG7 AAAGCGACAA CGTA7AACG7 7ACAC77C7G 

3 3 13 392 0 3 93 C 3 94 0 3350 

CGCAACAGAT CTTTTAAAAC 77GA7C7TGC CGA7G7CAAC CA7AGGAA7C 

OCG77G7C7A GAAAA77T7G AAC7AGAACG GC7ACAGTTG G7A7CC77AG 

3 36: 3 3~0 3 3 80 3 3 9 3 3 3 0 3 

» V * # « 

CGAGGGGATA 7ACGG7GC77 CA7G77GCTG CGA7GCGGAA GGAGCGACAA 

GC7CCCC7A.7 A..GCCACGAA G7ACAA.CGAC GC7ACGCC77 1 CC7CGGTG77 

3910 392 0 393C 394 0 3953 

7TGA7AC7A7 C7C7A77GGA AAAAGGTGCA AG7GCATCAG AAGCAACTTT 

AAC7A7GA7A GAGA7AACC7 7777CCACG7 7CACGTAG7C 77CG77GAAA 

3960 39"3 3930 3990 4000 

g g aa ggta:a ACCGCACTCA 7GA7CGCAAA AC aagccact atggcggttg 

CC77CCA7C7 7G3CG7GAG7 AC7AGCG777 TGTTCGGTGA TACCGCCAA0 

4C10 4000 403C 4040 4050 

AA7G7AATAA 7A7 2CCGGAG CAA7GCAAGC A77C7CTCAA AGGCCGACTA 

TTACAT7A77 A7AGGGCC7C G7TACGT7CG 7AAGAGAGTT TCCGGCTGA7 

4060 4CTC 4080 4090 410 3 

TGTG7AGAAA 7AC7AGAGCA AGAAGACAAA C G AG AAC AAA T7CCTAGAGA 

ACACA7C777 A7GA7C7CGT 7C77CTG777 GCTC7TG77T AAGGATCTC7 

4110 412 0 4130 4140 4150 

* • * • • 

TGTTCC7CCC 7C777TGCAG TGGCGGCCGA TGAATTGAAG ATGACGCTGC 

ACAAGGAGGG AGAAAACGTC ACCGCCGGCT AC77AlACTTC TACTGCGACG 

416 0 417 0 4180 419 0 4200 

TGGA7C7TGA AAATAGAGG7 A7C7A7CAAG 7C77A7T7C7 TATA7GTTTG 

AGC7AGAAC7 T7TATC7CC A TAGA7AGTTC AGAA7AAAGA ATA7ACAAAC 

4210 4220 4230 4240 4250 

* * * * ♦ 

AA77AAA777 A7G7CC7C7C 7 A 7 7 AGG AAA C7GAG7GAAC TAA7GATAAC 

TTAA7TTAAA TAG AGG AG AG A7AA7CCTT7 GACTCACTTG ATTAC7AT7G 

425 0 42^0 428 0 4290 430 j 



7ATTCTTTG7 G7CG7CCA0T G7T7AGTTGC AC7TGCTCAA CG7C7777TC 
ATAAGAAACA C AG C AGG 7 3 A CAAA7C AACG 7GAACGAG77 GCAGAAAAAG 

431 3 4320 4 3 31 4340 435 0 



WO 98/06748 PCT/US97/ 13994 

12/34 

CAACGGAAGC ACAAGCTGCA ATGGAGATCG CCGAAATGAA GGGAAGATGT 

GTTGCCTTCG TGTTCGACGT TACCTCTAGC GGCTTTACTT CCCTTGTACA 



4 3 60 

GAGTTC AT AG 
CTCAAGTATC 



* 

TGACTAGCCT 
ACTGATCGGA 



43 80 

CGAGCCTGAC 
G C T C G A T G 



439C 

CGTCTCACTG 
GC A\jAGTG AC 



44CC 

GTACGAAGAG 
CATGCTTCTC 



4410 442G 4430 444C 4450 

• * » 

AACATCACCG GGTGTAAAGA TAGCACCCTT CAGAATCCTA GAAGAGCATC 

*■ ioTAoTvjo*. llAlATTTCT ATCGTGGAAA gtcttaggat cttctcgtag 



4 4 6 0 

AAAGTAGACT 
TTTCATCTGA 

451C 

CATCGGACTC 
GTAGCCTGAG 



44 7 0 

AAAAGCGCTT 
TTTTCGCGAA 

4520 
* 

CTTATCACAA 
G AA T AG TG TT 



44S0 

TCT AAAACCG 
AGATTTTGGC 

4 5 30 

AAAACAAAAC 
TC T T T T T T G 



4490 

GTATGGATTC 
CATACCTAAG 

454C 

TAAATG ATCT 
A T T T A C T AG A 



4500 

TCACCCACTT 
AGTGGGTGAA 

4550 

TTAAACATGG 
AA.TTTGTACC 



456 0 

TTTTGTTACT 
AAAACAATGA 

4610 

GAAACGATTC 
CTTTGCTAAG 

4 6 60 

GTGAGGACTT 
CACTCGTGAA 



4570 

TGCTGTCTGA 
ACGACAGACT 

4620 

TTCCCGCGCT 
AAGGGCGCG A 

4670 

GACTCAACTG 
CTGAGTTGAC 



4580 
* 

CCTTGTTTTT 
GGAACAAAAA 

4630 

GTTCGGCAGT 
CAAGCCGTCA 

4680 
* 

GCTTGCGGAG 
CGAACGCCTC 



4590 

TTATCATCAG 
AATAGTAGTC 

4640 

GCTCGACCAG 
CGAGCTGGTC 

4690 

AAGACGACAC 
TTCTGCTGTG 



4600 

TGGAACTCGG 
ACCTTGAGCC 

4650 

ATTATGAACT 
TAATACTTGA 

4 7 00 

TGCTGAAGAA 
ACGACTTCTT 



47 10 472C 4^30 4740 4750 

* * * * ♦ 

ACG ACT AC A. K AA G AA C C A/*A GGTACATGGA AATACAAGAG ACACTAAAGA 

TGCTGATGTT TTCTTCGTTT CCATGTACCT TTATGTTCTC TGTGATTTCT 




AGGCCTTTAG 
TCCGGAAATC 



TGAGGACAAT 
ACTCCTGTTA 



4780 

TTGGAATTAG 
AACCTTAATC 



4790 

GAAATTCGTC 
CTTTAAGCAG 



4 300 
GGACTGTCTA 



431C 482 0 4330 434 C 485 0 

* » » . 

TCGACTTCTT CCACATCGAA ATCAACCGGT GGAAAGAGGT CTAACCGTAA 

AGCTGAAGAA GGTGTAGCTT TAGTTGGCCA CCTTTCTCCA GATTGGC ATT 



WO 98/06'48 PCT7US97/1J9SU 



486C 437C 493C 489: 



4,0c ^ / 



ACTCTCTCAT CGTCGTCGGT GAGACTCTTG CCTCTTAGTG TAATTTTTGC 

TGAGAGAGTA GCAGCAGCCA CTCTGAGAAC GGAGAATCAC ATTAAA-AACG 

4910 492C 493: 4943 495: 

» » * » • 

TGTACCATAT AATTCTGTTT TCATGATGAC T3TAACTGTT TATGTCTATC 

ACATGGTATA TTAAGACAAA AGTACTACTG ACATTGACAA ATACAGATAG 

4 960 4 97: 4 93 0 49 9 0 500 0 

» ♦ * » * 

GTTGGCGTCA TATAGTTTCG CTCTTCGTTT TGCATCCTGT GTATTATTGC 

CAACCGCAGT ACATCAAAGC GAGAAGCAAA. ACGT AGG AC A CATAATAACG 

5010 5020 50 3 2 504 0 5050 
***** 

TGCAGGTGTG CTTCAAACAA ATGTTGTAAC AATTTGAACC AATGGTATAC 

ACGTCTACAC GAAGTTTGTT T AC AA.C A T TG TT AAA C T T G G TTACCATATG 

5 0 6 0 5 0 7 0 5 0 3: 5093 5100 

AGATTTGTAA TAT A T ATT T A T G T A CATC AA CAATAACCCA TGATGGTGTT 

TCTAAACATT AT A TAT AAA T ACATGTAGTT GTTATTGGGT ACTACCACAA 

5110 512 0 513 0 5140 515 0 

* # * * * 

ACAGAGTTGC TAGAATCAAA GTGTGAAATA ATGTCAAATT GTTCATCTGT 

TGTCTCAACG ATCTTAGTTT CACACTTTAT TACAGTTTAA CAAGTAGACA 

5160 5170 518 0 5190 5200 

* * * • * 

TGGATATTTT CCACCAAGAA CCAAAAGAAT ATTCAAGTTC CCTGAACTTC 

ACCTATAAAA GGTGGTTCTT GGTTTTCTTA TAAGTTCAAG GGACTTGAAG 

5210 5220 5230 5240 525C 

TGGCAACATT CATGTTATAT GTATCTTCCT AATTCTTCCT TTAACCTTTT 

ACCGTTGTAA GTACAATATA CATAGAAGGA TTAAGAAGGA AATTGGAAAA 

5260 5 2^0 523 0 5290 5300 

* *r * • * 

GTAACTCGAA TTACACAGCA AGTTAGTTTC AGGTCTAGAG ATAAGAGAAC 

CATTGAGCTT AATGTGTCGT TCAATCAAAG TCCAGATCTC TATTCTCTTG 

5 310 5 3 2 0 5 3 3 0 5 3 4 0 5 3 5 0 

*■ * * • • 

A OTGAGTGGG CGTGTAAGGT GCATTCTCCT AGTCAGCTCC ATTGCATCCA 

TCACTCACCC GCACATTCCA CGTAAGAGGA TCAGTCGAGG TAACGTAGGT 

5360 537 0 539 0 539 0 5400 



AC ATTTGTGA ATGAC ACAAG TTAACAATCC 
TGTAAACACT TACTGTGTTC AA T TG TT A G G AAACGTGGTA AAGACCCACG 
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14/34 y 

5410 542C 5430 5440 5450 

• • * * * 

ATACATGGAA ACTTCTTCGA TT3AAACTTC CCACATGTGC AGGTGCGTTC 
TATGTACCTT TGAAGAAGCT AACT7T3AAG GGTGTACACG TCCACGCAAG 

5450 547C 5430 549C 5500 

• ♦ » * * 

GCTGTCACTG ATAGAGCAAG AGACT3AAAG CTTTCACAAA TTGCCCTCAA 
CGACAGTGAC T ATCTGGTTC TCTGACTTTC GAAAGTGTTT AAv_GGGAGTT 

5510 5520 5530 554C 5550 

***** 

ATC77CTGTT TCTA7CGTCA 7GAC7CCATA TC7CCGACCA C7GGTCATGA 
TAGAAGACAA AGATAGCAG7 ACTGAGGTAT AGAGGGTGGT G AC C AG TACT 

5560 5570 5580 5590 5600 

***** 

GCCAGAGCCC ACTGATTTTG AGGGAATTGG GCTAACCATT TCCGAGCTTC 
CGGTCTCGGG TGACTAAAAC 7CCCTTAACC C G A T T GG T AA AGGCTCGAAG 

5510 5620 5630 5640 5650 

• • * » • 

TGAGTCCTT Z TTTTTGATGT CCTTTATGTA GGAATCAAA7 TC77CC7TCT 
ACTCAGGAAG AAAAACTACA GGAAATACAT CCTTAGTTTA AGAAGGAAGA 

5660 5670 5630 5690 5700 

***** 

GAC7TGTGGA TCCAGCCTGC 7TCACAAGGC TCACCAGGTT GTAGTCTCCA 
CTGAACACCT AGG7CGGACG AAGTGTTCCG AGTGGTCCAA CATCAGAGGT 

5710 572D 5730 574C 575C 

• • • • * 

AAAATATCA^ GGAATTGTAA CCAAAAACAA TCC AGACAGA ACCTGTGATA 
TTTTATAGTA CCTTAACATT CGTTTTTGTT AGGTCTGTCT TGGACACTAT 

5760 5770 5780 5790 5800 

***** 

GACCCAAGGT TCTTGCCACA GTGATCCGGG 77CGTTAATA ACAGCAACTA 
CTGGGTTCCA AGAACGGTGT CACTAGGCCC AAGCAATTAT TGTCGTTGAT 

5 r 1 n 5820 5 R 3 0 5340 5850 

• . * * * 

TGTCCGGGTG AGGACTGGAG ACGAAGCAAA CGTCTTTCCT TTGTGTTACC 
ACAGGCCCAC TCC7GACC7 0 TGC77CG777 GCAGAAAGGA AACACAATGG 

5 860 58^0 5880 589 0 5900 

• • * * * 

TTCTCTCTGA TATTAGTGAC AAACCAACGT OAACTATCAG TGGACACT7C 
AAGAGAGACT ATAATCACTC TTTGGTTGCG G77GA7AGTC ACCTGTGAAG 

59 i: 592 0 593 0 594 0 5950 

» • • * * 

7TTGGTAAGC GCAAAGCAAG CGGGAAAAAC AA7CATCAGC GTCGAGTCCT 
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AAuAOCATTGG C0TTTC3TTC GCCCTTTTTG TTAGTAGTCG CAGCTCAGGA 

59 60 5 97C 59 9 0 5 99: 6 0 00 

GAG3AAAATC ATCAATTTCA TAGGGGTACT TGGCGTTCAA. GTOTTTTGAA 
OTCCTTTTAG TA3TTAAA.GT ATCCOCATGA ACGGCAAGTT CAoAAAlA 1 - TT 

6Ci: 6220 603C 5042 605C 

• * * • • 

7CCACTATGA TOAGAGGTCT ACAGTGTTGA AACCC77CAA 7GGACTGTGG 
AGGT jATACT AjTCTCGAGA 7G7CA2AAG7 77GG ; jAAG77 acctgacacc 

60 6 0 6073 6 C 3 :■ 609 0 6 10 0 

AAAOGCCGAlA AACG 1 — GCCAC CGAAGGATGG AAA.T7CAGGA 7TAGGGAAA-A 
TT73CGGGTT 77GC3CGG7G GCTTCCTAGG 777AAG7C07 AATCCCTTT7 

6 1 1 : 6120 613 0 6140 6150 

GCTCATATTG CAG73CACAA GTAGCCCATT AGATGAGTGA AA7GCAGGCA 
CGAG7A7AA0 GTOA3GTGTT CA7CGGG7AA 7G7AC7CAC7 TTACG7CGGT 

6160 5 1 " 0 6 1 8 0 61 9: 6200 

ATTAG7TTAG GC AA7AC7CT GAAAC7G7GA TCTTTGATTA CTTC OTGTTC 
TAATCAAATC CGTTATGAGA C77TGAGAG7 AGAAACTAAT GAAGGACAAG 

6210 6220 623 0 6240 6250 

* » * * » 

TGGTGCCCGC AGCTTTGAAG TTTTAAGCAT GTCACCAAAC TTTTCAACTC 
AGGACGGGCG TCGAAACTTC AAAATTCGTA CAGTGGTTTG AAAAGTTGAG 

6260 62~0 6230 6292 6302 

***** 

TGCTGTTAGA G7GGGTTGTA CCCTGATCAG AGAG7CAATC 7C7TG7GCTG 
ACGACAATCT CACCCAACAT GGGAOTAGTG 7G7GAGTTAG AGAAGACGAC 

63 10 632 0 633 0 6340 63 5 C 

***** 

CAAATTACAA GT7GAAG7TT TCCGGCTTAA 7AGAACAACA AGTATGTGGA 
GTTTAATGTT CAACTTCAAA AGGGCGAA7T A7C77GTTG7 TGATACACCT 

6360 6370 6380 6390 6400 

CCAACTACAC 77AGT7A7CT TAACAAGTCC A7GT7CT7C7 ATTC AATCTG 
GGTTGATGTG AA.TCAA7AGA A T T G TT C A GG TAOAAGAAGA TAAOTTAGAC 

6 4 10 5 4 2 0 6 4 3 0 6 4 4 3 o 4 5 

***** 

CCCGACGCGA CC AATTGC AT 77CCATC7GA TGGATTTAAA CGTATAGTCG 
GGGGTGCGGT GGTTAACGTA AAGGTAGACT AGGTAAATTT GCATATGAGC 

64 6 0 647 0 6 480 6 49C 6 50 0 
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AG3A7GA7AA G77GGAAC77 CAAGCATAGT C7CGAAAG7A GTGTCGTTC; 
7C77AC7A7T GA-A^GTTGAA .j777G7ATCA G AGGTTTG AT CACAGCAAiG™ 



7360 7 C ^ 0 7G3C 7C9C 710: 

C7ACATGAAG AAGTAGATAG ATAAAGAGAT CCGGTGAAAC AAGTACAGGA 

GA7G7ACTTG T7CATCTAT7 7A7T7CTCTA GGC C ACTTTG TTGA7GTCCT 

71IC 712 0 713 0 714 0 715C 



7ACTTACCAA AA7A7ATTGA ACA07GA7TT C7GCAGCTGC AATCCA.AAAA 

A7GAA7GG7T 77A7ATA-AC7 7G7GACTAA-A GACGTCGA^G TTAGGTTTTT 

7160 7170 719C 719C 7203 

• * * * * 

rCATTCAAC 'A ACGCAGTCTT TTGCCTAACC 

AACCTA77TC TGG7AAGT7G 77ACA7GAA7 7GCGTC AGAA AACGGATTGG 

7210 7220 7230 7240 7250 



77GA7CG7TT TAGGAGTGGA TCCTTCATAG 7AAACACGAT CAGGACCATA 
AAC7GGCAAA ATCCTCACC7 AGGAAG7ATC ATTTGTGGTA GTGCTGGTAT 

7260 727 0 7280 7290 7300 



C77GG7AGAA GG7TTCTCTC AAGGTTTCCA 7CGCCATGAC CATAACAGTC 

GAACCATCTT GGAAAGAGAG TTCC AAAGGT AGCGGTACTG GTA7TGTCAG 

7310 7320 7330 7340 7350 

« * * * » 

CTGCAGTGAA TTCTAAGAAA AATGTAAAAA ATTTTGGCCT AAAGTCATAA 

GACGTCACTT AAGATTCTTT TTACATTTTT TAAAACCGGA TTTGAGTATT 

7360 73"0 7380 739 0 7400 

<r » » * » 

77C77AACA7 ACGAAACCA7 GGAGAACTCC A7G7CTAAA.A AATAAAGGCT 

AAGAA7TGTA 7GCTTTGG7A CC7C7TGAGG 7ACAGATTT7 TTATTTCCGA 

7410 7420 743 0 7440 7450 

* • • • • 

AAAGCT7T77 GGCGACAGAA GCAGATAAAT CCATTCAAAA CACATAAACT 

77TCGAAAAA CCGCTGTCT7 CG777ATTTA GGTAAGTT77 GTG7ATTTGA 

-? a c r-. -r * -! n *~f A Q r. 1 C r. 

/ *i O W / ■* O ^ , -t J ^ ' V ^ 

# » » • » 

GATTTGT7A7 7TG7CAC7A7 GAGT7ATGA7 777GAACA7T 7CC AGATGCA 

7510 752C 753 0 754C 

» * ♦ • 

AA77CAAAAC TGGAGAA77G TCAGATCGGG 7GTGGC7AGT AGAAGCTT 

T7GA377T7G ACC7CTTAAC AG7C7AGCCC ACAC7GA77A 7C7TCGAA 
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TCCTTCTC A.A. CGTCTCGTAC T A 3 A. 3 A C C T T T 3CTGCCCTC T A_A Tj j *-\_rv C A 

GGGAG ATTACGTTGT 

♦ * * • » 

3 3^\ jT w — -_j ^ ^ T3TT3 A. 3 0 T ATT _ 3 - .-^r^-^-w - .-vl.^a - — — . 

_; _j . _ A. 3 3 T G : _ GG.**.A.\jAjAGT GGAGTA'-jGGA. TA -:A_AA.TTTT 'jTGTTG'-'GA i 



£550 657: 6530 6590 6600 

AC GC A-ATCG ATGATCATCA AT CO AC A- AA3 TA.'- A^'_ A.AuA3T A.C ACTGTTTT 

GTGGGTTA-AG TACTAGTAGT TA3GTGTTTG A ~ _ T r _: . T . 3 A TGTGACAlAAA 

6 610 652 0 6631 6640 655 0 

GAA.GCACTGG AATCAACAA3 A33TTTA3TT A^ATA-AGCAGG CATACGGTAA 

; _ XT _ '.jjGG A^_3'_ TTA.-_>T * GTTG TGGAA^AT'jrt-A TTA * * ^ ■ ^ i j C ^TATG^ >ATT 

6650 667 C 658 0 6690 570 0 

TACCTCTAAG GCTGGGACAT T3AAACCTTG TGTGC ATC AT C T 3 A^A C 0 C G A 

ATGGAGATTG GGACCGTGTA AGTTTGGAAC ACACGTAGTA GAGTTGGGCT 

6 7 1 : 6723 6733 6 7 4 C 6750 

* * * # # 

GTTTTTATCG GTTATTTCTC CATCCCCACC TCC ACGAGTG CTACCATTTC 

C AAA_AA TAG G GAATAA-AGAG GTAGGGGTGG AGGTGCTCAC GATGGTAAAG 

6760 677: 5790 6790 6900 

* » * * » 

CGAAGTCAGA ATTTTCGTCG TCTTCAATCG ACCCGTTACT GTTACCCACT 

GCTTGAGT3T TAlA^AAGGAGC AGAAGTTAGG TGGGCAATGA CAATGGGTGA 

6 S I 0 6 3 2 0 6 3 3 0 6 S 4 0 6 3 5 2 

CCCTGAACCT CTAAACCATT ATCTCTCT3T ACTTTCACAG ATGCATGTGA 

GGGAGTTGGA GATTTGGTAA TAG AG AG AG A TGAAAGTGTC TACGTACACT 

6860 6 37 0 6330 5890 6930 
***** 

3ACATAATCA GTAGCTTCTT GGGGTTGTTG CGTCCTCTGT GTATTCGAGG 

GTGTATTAGT CATCGAAGAA CCCCAACAAC GGAGGAGACA CATAAGCTCC 

€910 6920 6930 5940 695C 
*»»*•* 

AACTAGCGGG ATATTCTATT A 3 G G A T G AA 3 AAGCAGCATG ATCAGTAACA 

TTGATCGCCC TATAAGATAA TGCCTACTTG TT3GTCGTAC TAGTCATTGT 



i960 597 2 5 93 0 699 0 



TTATCAGATG TCGATTTCAC TT 33 AAA TAG AA3TCCACAT TTCTTATAGA 
AATAGTCTAC AGCTAAAGT3 AAGGTTTATG TTGAGGTGTA AAGAATATCT 

7010 7020 "C3C 7C4 0 "05 0 



WO 98/06-M8 



PCTUS97/ 13994 

24/34 

* - - - - - i -\ c r. 

• • » » • 

GTGACTTTC7 AACTATGGCT GAAATTGCAG AACGAAAAAG ACTTTCCATT 
CACTGAAAGA TTGATAGCGA 7TTTAACG7 7 TTGCTTTTTG TGAAAGGTAA 



ATGAAA7CCA AAATGGAAAT 7TATCTCTCT 



AAAGTGAACT TACTTTGGGT TTTACCTTTA GATAGAGAGA AGAAGAAGAG 

110 12 2 12 j 14 0 15C 

* * • • * 

7TTTA.C7A. - _ _ A - i T C J A TGGCTTTCC 2 7 2CTCTAGCT 722CTAGCTC 

AAAATGATGG A GG T AAA. G G T A 7 7 G AAA GGG A.GGAGATGGA AGGGATCGAG 

16 0 17 7 18 j 19 C 200 

TTTT7AATT? CTAGAATATT CTTTTCTTAG TCTGTAATTA TCTATAGCTC 
AAAAGTTAAA GAT^TTATAA GAAAAGAATC AGACATTAAT AGATATCGAG 

210 22j 2 3 3 24 1 250 

***** 

AATTTCTAAj A_A jAACTTA TGTAAGGCGG 7TTTCTGTAA TGGATAATAG 
TTAAAGATTC T 3T7TTGAAT ACATT7CGC 7 GAAAGACATT ACCTATTATC 

2 £ C 2-7 : 23 C 29 0 300 

7AGGACTG2 2 TTTTCTGATT CGAATGACAT 7AGCGGAAGC AGTAGTATAT 
ATC77GA77 7 AAAA 7 A 7 T AA G7TTA7TG7A G77G7C77CG 77A7CA7A7A 

313 32C 33 0 34 0 350 

* * ♦ * * 

GCTGCATCGG 2GG7GG7A7G ACTGAATTTT 777CGCCGGA GACTTCGCCG 

353 370 3 80 3 90 4CC 

* » » * • 

GGGGAGA77A C77CAC7GAA ACGCC7A7CG GAAACAGTGG AA7CT A7C7T 
CGCCTCTAGT GAAGTGAC77 TGCGGATAGC C77TG7GAGC TTAGATAGAA 

410 420 43 0 440 450 

CGATGCGTCT TTGCCGGAG7 TTG ACTACTT CGCGGACGCT AAGCT7G7GG 
GCTACGCAGA AACGGCCTCA AACTGATGAA GCGG77GCGA TTCGAACACC 

460 4^0 49: 490 500 

***** 

TTTC 7GGC2C GTGT AAGGAA ATTCCGGT 7 7 AC 7GGTGCAT 777G7CGGCG 
AAAGGCCGGG 0 A 2 A 7 T 7 777 TAAuGCC A'-j * ^j^Ln^'oTA .iAAlAG^l'o ; _ 

510 520 S3C 54 0 550 

***** 

AGGA3TCCGT 7C777AAGAA TTTG77C7G7 3GTAAAAAGG AGAAGAATAG 
TCCTCAGG7 A AGAAAT7C77 AAACAAGACG 77 A77777C 7 TCTTCTTATC 



WO 9H, 06^48 
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56: 



53 3 



59: 



6o: 



TAGTAAGGTG GAATTGAAGG AGGTGATGAA AGAGCATGAo GTGAGCTATC 



/A _ „ 



1TACTC 



:gata c 



ATGCTGTAA.T 
TAC GA 7 ATTA 



TATTG GCTTATTTGT ATAGTGGTAA AGTTAGGCC" 



3AATAAA ! 



:att 

69C 



TGA 7GTAAAG AT 7TGTGTGT TTGTGTGGA2 AATGACTGCT CTCATGTGGC 



7i: 



"'ACA.CACACA AACACACCTG TTACTGACGA GAGTACACC 



720 



"3 j 



rG 



GCCA 3CTGTGGCAT 



AACAT 7CGGT 7GACA7CGTA AGGA2CAA2T 
7 ^ 



74: 



JTTTTGTAC AC ATCATTTA 
'AAAACATG TGTAGTAAAT 



7£ :' 



a n n 



PTTCAC 



:tgaattg gttgacaagt 



:ttaa: 



7TCAGAGACA CCTACTGGAT 
AAGTCTCTGT GGATGACCTA 



8 30 



3 40 



850 



ATTGTTGACA AAACTGCAGC AGACGATGTA ATGATGGTTT TATCTGTTGC 
TAAGAACTGT TTTGACGTCG TCTGCTACAT TATTACCAAA ATAGACAACG 



38 0 



AAACATTTGT jGTAAAGCAT GCG AGAG AT r 



890 



90C 



:tttcaagc tgcattgaga 
'tgtaaaca ccatttcgta cgctctctaa. cgaaagttcg acgtaactct 



9i: 



92 0 



930 



940 



950 



TTATTGTCAA GTCTAATGTT GATATCATAA CCGTTGATAA AGCCTTGCCT 
AATAA.GAGTT CAGATTACAA CTATAGTATT GGGAACTATT TCGGAACGGA 



96C 



97 0 



930 



99C 



1000 



:atgacattg taaaacaaat tactgattca ogaggggaac ttggtctaca 

j-TC'jCCTTG AACCAGATGT 



fTAOTGTAAC 

io:: 



A * TT . GTTTA 

1020 



.ATT AAGT 

10 3 C 



1040 



1050 



jGGCCTGAA AGCAACGGTT TTGCTGATAA AGATGTTAAG AGGATACATA 



^cccggact: 

106: 



TCGTTGCCAA AAGGA7TAT" 



107C 



103C 



GGGCATTGGA TTCTGATGAT GTTGAATTAC 



"GTACAATTC TCCTATGTAT 
10 9 0 110 0 

.\'^A- u j\TGTT GGTAAGAGAG 
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1 ~ 23 33 43 5C 

• • • * * 

7CGA7C777A ACCAAA7CCA G77GA7AAGG 7C7C77CGT7 GATTAGCAGA 

AGC7AGAAA7 tggtttaggt caacta77CC agagaagcaa ctaatcgtct 

6 : ?: so 90 ioo 

* » » ♦ * 

GA7C7C777A A777G7GAA7 77CAA77CA7 CGGAACCTGT TGA7GGACAC 

C 7 AG AG AAA 7 7 AAA C A C 77 A AAG77AAGTA GCC77GGACA AC7ACC7G7G 



CACCA77GA7 GGA77CGCCG A77C77A7GA AA7CAGCAGC AC7AG777CG 

G7GG7AAC7A CC7AAGCGGC 7 AAGAA7 AC 7 77AG7CG7CG 7GA7CAAAGC 
7ID GFA D S Y E ISS 7SF> 

16 0 17 0 18 0 190 20C 

* « » • » 

7CGC7ACCGA TAACACCGA 0 TCC7C7A77G 777A7C7GGC CGCCG AACAA 

AGCGATGGC7 A77G7GGC7C ACGAGA7AAC AAA7AGACCG GCGGC77G77 

A 7 10 N* 7 V SSI Y Y L A A F Q > 

210 220 230 240 250 

G7AC7CACCG GACC7GA7GT A7C7GC7C7G CAA77GC7C7 CCAACAGCT7 

CA7GAG7GGC C7GGAC7ACA 7AGACGAGAC G77AACGAGA GG77G7CGAA 

V 7 7 G ? D V 3 A 7 Q 7 7 > N F F> 

260 27 0 280 290 300 

• * » » » 

CGAA7CCG7C 777GAC7CGC CGGA7GA777 C7ACAGCGAC GC7AAGC77G 

GCTTAGGCAG AAAC7GAGCG GCCTAC7AAA GA7G7CGC7G CGA77CGAAC 

E S V FD5 PCDF YSD A K 7> 

3 10 3 2 3 3 3 0 3 4 0 3 5 0 

» • * * * 

77C7C7CCGA CGGCCGGGAA G777C777CC ACCGG7GCG7 777G7CAGCG 

AAGAGAGGC7 GCCGGCCC77 CAAAGAAAGG 7GGCCACGCA AAACAG7CGC 
v 7 3 7 ore v 5 f h r c v 7 3 a > 

360 37 ij 380 39 0 400 

A 3AA3C7 077 7C77CAAGAG C GC 77 7 AGCC GCCGCTAAGA AGGAGAAAGA 

T 77 C G A _j AA AGAAG77C77 GC*G AAA7CGG CGGCGA77C7 7CC7C777CT 

R S S F F K 3 A 7 A A A V. K F K D> 

410 4 2: ;30 440 450 

» * » * * 

C7CCAACAAC ACCGCCGCCG 7GAAGC7CGA GC77AAGGAG A77GCCAAGG 

GAGG77G77G 7GGCGGCGGC AC77CGAGC7 CGAA77CC7C 7AACGG77CC 
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460 4*^0 43: 49: s:: 

* * * . » • 

ATTACGAAGT CGGTTTCGAT TCGGTTGTGA CTGTTTTGGC TTATGTTTAC 

TAATGGTTCA GC_ A-AAGCTA AGCCAACAOT GACAAuAACGG AATACA-AA7G 

DYE:*/ G r G 3 V V 7 V 1 A Y V Y > 

51C 520 53: 54: 550 

AGCAGCAGAG TGAGACCGCC GCCTAAAGGA GTTTCTGAAT GCGCAGACGA 

"GGATTTCCT C AAA G A C T T A CGCGTCTGCT 



GAATTGCTGC CACGTGGGTT GCCGGCCGGC GGTGGATTTC ATGTTGGAGG 

CTTAACGACG GTGGACC 3AA CGGCCGGCCG CCACCTAAAG TACAACCTCG 

N C G H V r A C R ? A V D F M L E > 

610 62 0 63C 640 650 

TTCTCTATTT GGC7TTCATG TTCAAGATCC C7GAATTAAT TACTCTGTAT 

AAGAGATAAA CGGAAAGTAG AAGTTCTAGG GACTTAATTA ATGAGAGATA 



650 6^0 680 690 700 

• * • w * 

CAGAGGCACT 7ATTGGACGT TGTAGACAAA GTTGTTATAG AGGACACATT 

GTCTCCGTGA ATAACCTGCA AGATCTGTT7 CAACAATATC TCCTGTGTAA 

Q R H L L G V V D K V I E G 7 L> 

7 10 720 730 740 75C 

GGTTATACTC AAGCTTGCTA ATATATGTGG TAAAGCTTG7 ATGAAG *TAT 

CCAATATGAG TTCGAAGGAT TAT AT AC AG C ATTTCGAACA TACTTCGATA 

V I L K L A N I C G K A C M K G> 

76 0 l n 0 780 790 800 

m » * # * 

TGGATAGATG TAAAGAGATT ATTG7CAAGT CTAATGTAGA TATGGTTAGT 

ACCTATCTAC ATTTCTCTAA TAACAGTTCA GATTACATCT ATACCAATCA 

L D R 7 KG! I V K S N V G M V S > 

810 B2 0 6 3 0 840 35 0 

• * * * * 

GTTGAAAAGT CATTGCCGGA AGAG7T7GT7 AAAGAGATAA TTGATAGACG 

GAAGTTTTCA GTAACGGCCT TC7CGAACAA TTTCTCTATT AAGTATCTGG 

L £ K S L ? E EG V >: EI I G ?. ?. > 

850 8 70 830 890 900 

• * * # • 

TAAAGAGCTT GGTTTGGAGG TACC7AAAGT AAA G AAAC AT GTCTGGAATG 

ATTTCTGGAA GCAAACCTCC ATGGATTTGA TTTCTTTGTA CAGAGCTTAG 

K E L G L E V P K V K. K H S N > 
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91C 92C 93C 940 95 0 

• * » * * 

TACATAAGGC ACTTGAGTCG GATGATATTG AoTTAGTGAA G TTG C TTTTG 
ATGTATTCCG TGAACTGAGC C7ACTATAAG TGAATCAGT7 CAAGGAAAAC 
V H K A IDS DDI E 1 V K L L L> 

960 970 980 990 1000 

• • » » * 

AAAGAGGATC AGAGCAATGT AGATGATGCG TGTGCTCTTC ATTTCGCTGT 
TTTG7CCTAG TGTGGTTAGA TCTACTACGC ACACGAGAAG TAAAGCGAC A 
K E D K T N L O D A C A 1. H F A V> 

1010 1C20 1030 1040 105C 

• * * * * 

TGCATATTGC AATGTGAAGA CCGCAACAGA TC7TTTAAAA CTTGATCTTG 
ACGTATAACG TTACACTTCT GGCG7TGTGT AGAAAATTTT GAACTAGAAC 

A Y C N V K T A T D L L K L D L> 

106C 1070 1030 1090 HOC 

• * » » » 

CCGATGTCAA CCATAGGAAT CCCAGGGGAT ATACGGTGCT TCATGTTGGT 
GGCTAGAGTT GGTATCCTTA GGCTCCCCTA TATGCCACGA AGTACAACGA 
A G V N H R N ? R G Y T V L H V A> 

111C 112C 1130 1140 1150 

***** 

GCGATGCGGA AGGAGCCACA ATTGATACTA TGTGTATTGG AAAAAGGTGC 
CGCTACGCCT TCCTCGGTGT TAACTATGAT AGAGATAACC TTTTTCCACG 
AMR K E ? C L I L S L L EKGA> 

1160 1170 1180 1190 1200 

• • * * 

AAGTGCATCA GAAGCAACTT TGGAAGGTAG AACCGCACTC ATGATCGCAA 
TTCACGTAGT CTTCGTTGAA ACCTTCCATC TTGGCGTGAG TACTAGCGTT 
S A 5 EAT LEGE TAL M I A> 

1210 1220 1230 1240 1250 

***** 

AACAAGCCAC TATGGCGGTT GAATGTAATA ATATCCCGGA GCAATGCAAG 
TTGTTCGGTG ATACGGGCAA CTTACATTAT TATAGGGCCT CGTTACGTTC 
K n A T M A V E C N N I P E Q C K> 

1260 1270 123 0 129 0 1300 

***** 

AGC AAGAAGACAA 

GTAAGAGAGT TTCCGGCTGA TACACATCTT TATGATCTCG TTCTTCTGTT 

;t r* t l' f~* O ' r- t : £ I L E E D K > 

1310 1320 1330 1340 135C 

***** 

ACGAGAACAA ATTCCTAGAG ATGTTCCTCT CTCTTTTGCA GTGGCGGCGG 
TGCTGTTGTT TAAGGATCTC TACAAGGAGG GAGAAAACGT GACCGCGGGC 

REQ 1 P R D V ? ? £ F A V A A> 
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I3 6C 1370 133 3 1390 143 3 

ATGAATTGAA GATGACGCTG CTCGATCTTG AAAATAGAGT TGCACTTGCT 
7AC77AACTT CTA3TGCGAC GA3CTAGAAC TTTTATCTCA ACGTGAACGA 
C £ 1 K M T L L Z L E N R A 1 A> 

14IC 1420 1430 1443 1450 

***** 

CAACGTCTTT TTGGAACGGA AG3ACAAGC7 GGAATGGAGA TCGCGGAAAT 
GTTGGAGAAA AAGvjTTGGCT TEGTGTTCGA CGTTACCTCT AG3GGCTTTA 
OR L F ? T E A w A AME I A E M> 

I46C 1470 1430 149C 1S0C 

* • * • * 

GAAGGGAACA 7GTCAGTTCA 7AG7GACTAG 3CTCGAGCCT GACCGTCTCA 
CT7CCCTTGT ACACTCAAGT ATCACTGATC GGAGCTGGGA GTGGGAGAGT 
K G T CEP I V T S IE? D R L> 

1510 1520 153C 1540 1550 

GTGGTACGAA GAGA-AC AT 7 A CC3GGTGTAA AG AT AGO AC C TTTGAGAATC 
GACCATGCTT CTCTTGTAGT GG7C7ACATT TCTATCGTGG AAAGTC TTAG 

T G T y. ?. T 5 ? G V K I A ? f F I > 

15 6 0 15 7 3 15 3 3 15 9 7 15 0 0 

***** 

3TAGAAGAGC ATCAAAGTAG A3TAAAAGCG 3TTTCTAAAA CCGTGGAACT 
GATCTTCTCG TAGTTTCATC TGATTTTCGC GAAAGATTTT GGCACCTTGA 
LEE H Q S R L K A L S K T V E L> 

1610 1620 1530 1640 1650 

***** 

7GGGAAACGA TTCTTCCGGC GGTGTTCGGC A.GTGCTCGAC CAGATTATGA 
GCCCTTTGC7 AAGAAGGGCG CGACAAGCCG TCACGAGCTG GTCTAATACT 
GKR FF? RCSA VLD Q I M> 

1660 1670 1680 1690 1700 

* r » » » 

ACTGTGAGGA CTTGACTCAA C7'3GCTTGCG GAGAAGACGA CACTGCTGAG 
TGACACTCCT GAACTGAGTT GACCGAACGC CTCTTCTGCT GTGACGACTC 
N C E D L T Q LAC GEDD T A E> 

1710 1720 1730 1740 175C 

***** 

AAACGACTAC AAAAGAAGCA AAGGTACATG GAAATACAAG AGACACTAAA 
TTTGCTGATG TTTTCTTCGT TTCCATGTAC CTTTATGTTC TGTGTGATTT 
KRL 3 K K Q RYM E I 0' ETLK> 

176C 177 0 17B0 17 9 0 130C 

***** 

GAiAGGCGTTT AGTGAGGACA ATTT^-rGAATT AG'oAAAiTCG TCCCTGACAG 
CTT 3CGGAAA TCACTCCTGT TAAACCTTAA TCCTTTAAGC AGGGACTGTC 

v a f see l e l c n s s l t > 
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13 10 132 0 132 Z 184 0 185 0 

* » » • * 

A77CGAC77C TTCCACATCG AAATCAACCG G7GGAAAGAG G7C7AACCG7 

TAAGCTGAAvj AAGGTGTAGC 777AG77GGC CACC77TC7C CAGA77GGCA 

0 3 7 S S 7 S K 3 T G G K ?. 3 N R> 

1360 1S7C 1380 1390 1900 

» * * * • 

AAAC7C7G77 A7CG7CG7CG G7GAGAC7C7 TGCCTCTTAG TGTAA7T7T7 

777GAGAGAG 7AGCAGCAGC CAC7C7GAGA ACGGAGAA7C ACATTAAAAA 



AC7G7AAC7G 
TGACATTGAC 

1990 

7TTGCATCC7 
AAACGTAGGA 

2 0 40 

ACAATTTGAA 
7G77AAAC77 



TTTATGTCTA 
AAATACAGAT 

2000 

GTGTATTATT 
CACATAATAA 

2 0 5 0 

CCAATGGTAT 
GGTTACCATA 



GC7G7ACCA7 
CGACATGGTA 

19 60 
AGCAACCGCA 



GCTGCAG3TG 
CGACGTCCAC 

2060 

ACAGAT77G7 
7G7C7AAAC A 



A7AA77C7G7 
7A77AAGACA 

1970 

CA7ATAGTTT 
G7A7A7CAAA 



TGCTTCAAAC 
ACGAAG777G 

2070 

AA7ATATA77 
77A7ATA7AA 



AAAG7ACTAC 

1980 

CGC7C7TCG7 
GCGAGAAGCA 

2 0 3 0 

AAA7G7TG7A 
777ACAACA7 

2080 

7ATG7ACA7C 
A7ACA7GTAG 



2090 

AACAA7AAAA 
77G7TA777T 



2100 

AAAAAAAAAA 
77T7T7TTTT 



AAAA 
7777 
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OGCGTAACCT AAGACTAC7A CAACTTAATG ATG7TTACAA C3AT7CTC7C 

1110 112 C 1130 1140 1150 

* * m • • 

GGGCA7AC7A C707AGA73A tgcatatgct gtggattatg ctgtagggta 

GGGGTATGAT GGGA7C7A77 ACG7A7ACGA GAGG7AA7AC GACA7CGCAT 



1160 11-0 1130 1190 1200 

* * » * * 

TTGGGATGCA AAGACTACAG CAGAAC77C7 AGATCTTGCA CTTGCTGATA 

AACGC7ACG7 T77TGA7G70 G TC TTGAAG A TCTAGAACG7 GAACGACTAT 



— i. _ - 



AATTAGTAGT TTTAAGTTCC GGTATGTGCC ACGACGTACA ACGTCGGTAC 



1260 

AGGAAAGAGC 
7CC77TCTCG 

1310 

7TCTGATCTG 
AAGACTAGAC 

1360 

7CAC7AGGC7 
AGTGATCCGA 



1270 

CTAAAATTGT 
GATTTTAACA 

1320 

ACATGCGATG 
TGTAGGCTAC 

1370 

TGTGGATTTC 
ACACCTAAAG 



1230 

AGTGTCCCTT 
TCACAGGGAA 

1330 

GAAGAAAAGC 
C77C777TCG 

1330 

AGTAAGTC7G 
TGA77CAGAG 



1290 
* 

77AACCAAAG 
AATTGGTTTC 



1390 

CGGAGGAAGG 
GCCTCCTTCC 



130C 
* 

GAGC7AGACC 
CTCGATCTGG 



1400 
* 

AAAATCTGCT 
TTTTAGACGA 



13 4 0 

ACTTCAAATC 
TGAAGTTTAG 



1350 

GCCAAGAGGC 
CGG77C7CCG 



1410 
* 

TCGAATGATC 
AGCTTACTAG 



1420 
* 

GGTTATGCAT 
CCAATACGTA 



1430 
* 

TGAGATTCTG 
ACTCTAAGAC 



1440 

GAGCAAGCAG 
CTCGTTCGTC 



1450 

AAAGAAGAGA 
TTTCTTCTCT 



1460 

CCCTCTGC7A 
GGGAGACGA7 



14^0 

GGAGAAGGTT 
CCTCTTCGAA 



1430 

^TGTATCTCT 
G AC AT AG AG A 



1490 
* 

TGCTATGGCA 
ACGA7ACCG7 



1500 

GGCGATGATT 
CCGCTACTAA 



1510 
* 

7GCGTA7GAA 
ACGCATACTT 



15 2 0 

GCTGTTATAC 
CGACAA7A7G 



15 3 0 

CTTGAAAATA 
v_j AACTTTT A.T 



15 4 0 
* 

G AG77GGCC7 
C7CAACCGGA 



155 0 

GGCTAAACTC 
v_ ^GATTTGAG 



15 6 0 

0TTT77CCAA 
GAAAAAGGTT 



7GGAAGC7AA 
ACCTTCGATT 



15 3 0 

AG77GC AAT(_t 
7CAACG77AC 



15 9 0 

GACA7TGCTC 
CTGTAACGAG 



1600 

AAG7TGA7GG 
TTCAACTACC 



1610 



16 3 0 



IB 4 0 



1650 
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CAC7TGTGAG TTCCCACTGG CTAGCATCGG CAAAAAGATG GCTAATGC AC 
GTGAAGAGTC AAGGGTGACC GA7CGTAGCC GTTTTTC7AC CGATTAGGTG 

166C 16 7C 1690 165: 17CC 

* • * - * 

AGAGGACAAC AG7AGATTTG AAC3AGGCTC C7TTCAAGA7 AAAAGAGGA3 
TG7CCTGTTG TCA7CTAAAC 7TGCTCCGAG GAAAGT7CTA TTTTCTCCTC 

i 7 i- :^2C i^3G :?;: :75c 

CACTTGAATC GGCTTAGAGC ACT 07CTAGA ACTGTAGAAC TTGGAAAACG 
GTGAACTTAG CCGAATCTGG 7GAGAGATCT TGACA7CTTG AACCTTTTGC 

276C 177C 1780 179: 19CC 

* * * » 
CTTCTTTCCA CG77G7TCAG AAGT7CTAAA TAAGATCATG GATGCTGATG 
3AAGAAAGG7 GC AACAA.G7C 77CAAGATTT A7TC7AG7AC CTACGACTAC 

181 C 1820 1830 184 0 1350 

***** 

ACTTGTCTGA GATAGCTTAC ATGGGGAATG A7ACGGCAGA AGAGCGTCAA. 
7GAACAGAC7 C7A7CGAA7G TACCCCTTAC TATGCCGTCT 7GTCGCAGT7 

1360 1870 1830 1390 1900 

* * - • , 

CTGAAGAAGC AAAGGTACAT GGAACTTCAA GAAATTCTGA CTAAAGCATT 
GACT7CTTCG TT7CCA7GTA CC77GAAGTT C7TTAAGACT GATTTCGTAA 

1910 1920 1930 1940 1950 

* * * * . 

CACTGAGGAT AAAGAAGAAT ATGATAAGAG TAACAACATC TGCTCATCTT 
G7GAC7CC7A TT7CT7C77A TACTATTCTG A77G7TG7AG AGGAG7AGAA 

I960 19^0 1980 1990 2000 

***** 

GTTCCTCTAC ATCTAAGGGA GTAGATAAGC CCAATAAGCT CCCTTTTAGG 
CAAGGAGATG TA3A7TCCCT CATCTATTCG GGTTATTCGA GGGAAAATCC 

2010 2 0 20 203 0 2040 205 0 

***** 

AAATAGGTAA TTGTAT7AGG ATATATGAGG AAGAAGAGGA TTTTCTTGTA 
TTTATCCATT AACATAATCC 7ATATACTCG T7CTTCTCCT AAAAGAAGAT 

2060 207 0 2 0 80 2090 210C 

* 

ACATAGGACT CTTTCCTTTC A7CAT77GAT ATGT C AACAT ACATACAACA 

7G7A7CG7GA GAAAGGAAAG 7AGTAAACTA 7ACA377G7A TGTATGTTGT 

2110 2120 2130 214 0 215 0 

3CTG7ACCAT AAAC7TGTAT TGTTGC ACTT A C AA C 77 7 G A AGAACAGAA7 
CGACATGG7A TTTGAACATA ACAAGG7GAA. 73773AAAC7 TGTTGTGT7A 

216 0 2 17 0 
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TTATT7GAAA AAAAAAAAAA AA 
AATAAACTTT T7TTTTTTTT TT 
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5 0 

***** 

MDNSRTAFSDSNT^ Z SG3S5ZZCZ GGGMTEFFSPETS PAEIT3LKRLSETL 

* » » • • 

ESIFDASLPEFGYF AOAKLVYSGPC-CEI PVHRCI LS ARS PFFKNLFCGKK 

153 

» * • * * 

E KN S S KVE L K E//M K E H EV S YD A VM S VLA Y L Y SG KT/R P S P KDVCVC 7T0NTDC 

2 3 0 

SKVACRPAYAFLVEV LYTS FT FQ I SEDYDKFQRH LLOI LDKTAADDVMMV 

2 S C 

13 VAN I CGKAC ERL LS 3C I EI IVKSNVD I ITLDKALPHDI VKQITDSRAE 

303 

***** 

LGLQGPESNGFPDK:TvT^IHRALDSDOVELLQM1LREGHTT1DDAYALHY 

350 

***** 

AVAYCDAKTTAELLDLALADINHQNSRGY'Tv'LKVAAMRKEPKIVVSLLTK 

400 

***** 

garpsdltsdgrkalqiakp.ltrlv-dfskspeegksasndrlcieileca 

450 

***** 

E RR.D P 1 LG EA S V 3 LAMAGD D L RM K L L Y L ENR VG LA KLLFPMEA KV AMD I A 

5 00 

QVDGTSEF PLAS I GKKMANAQRTTVT^LNEAPFKI KEEHLNRLRALSRTVE 

550 

* * * * * 

LGKRF FPRCSEVLNKIMDADDLSE IAYMGNDTAEERQI-KKQRYMELQEIL 

TKAFT EDKEE YDKTrir; I S SSCS ST3KGVDKPNKL P F RK 
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Dosage effect of NPR1 on Psm ES4326 resistance 
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Dosage effect of NPR1 on growth of P. parasitica 

1000 
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